MARC details
000 -LEADER |
fixed length control field |
06548cam a2200793 i 4500 |
001 - CONTROL NUMBER |
control field |
on1096525137 |
003 - CONTROL NUMBER IDENTIFIER |
control field |
OCoLC |
005 - DATE AND TIME OF LATEST TRANSACTION |
control field |
20220712085959.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS--GENERAL INFORMATION |
fixed length control field |
m o d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION |
fixed length control field |
cr cnu---unuuu |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
190413s2019 enk o 000 0 eng d |
040 ## - CATALOGING SOURCE |
Original cataloging agency |
EBLCP |
Language of cataloging |
eng |
Description conventions |
pn |
Transcribing agency |
EBLCP |
Modifying agency |
TEFOD |
-- |
OCLCF |
-- |
OCLCQ |
-- |
UKAHL |
-- |
OCLCQ |
-- |
N$T |
-- |
OCLCQ |
-- |
NLW |
-- |
K6U |
-- |
OCLCO |
-- |
UKMGB |
-- |
OCLCO |
015 ## - NATIONAL BIBLIOGRAPHY NUMBER |
National bibliography number |
GBC216935 |
Source |
bnb |
016 7# - NATIONAL BIBLIOGRAPHIC AGENCY CONTROL NUMBER |
Record control number |
019365464 |
Source |
Uk |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
1789533449 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781789533446 |
Qualifying information |
(electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
Cancelled/invalid ISBN |
9781789533583 |
Qualifying information |
print |
029 1# - (OCLC) |
OCLC library identifier |
AU@ |
System control number |
000065314501 |
029 1# - (OCLC) |
OCLC library identifier |
UKMGB |
System control number |
019365464 |
029 1# - (OCLC) |
OCLC library identifier |
AU@ |
System control number |
000070535697 |
035 ## - SYSTEM CONTROL NUMBER |
System control number |
(OCoLC)1096525137 |
037 ## - SOURCE OF ACQUISITION |
Stock number |
17D228CA-B9A5-47F9-8400-6F06CA49CCCE |
Source of stock number/acquisition |
OverDrive, Inc. |
Note |
http://www.overdrive.com |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER |
Classification number |
QA76.73.P98 |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
005.133 |
Edition number |
23 |
049 ## - LOCAL HOLDINGS (OCLC) |
Holding library |
MAIN |
100 1# - MAIN ENTRY--PERSONAL NAME |
Personal name |
Balakrishnan, Kaushik. |
9 (RLIN) |
829423 |
245 10 - TITLE STATEMENT |
Title |
TensorFlow Reinforcement Learning Quick Start Guide : |
Remainder of title |
Get up and Running with Training and Deploying Intelligent, Self-Learning Agents Using Python. |
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) |
Place of publication, distribution, etc |
Birmingham : |
Name of publisher, distributor, etc |
Packt Publishing Ltd, |
Date of publication, distribution, etc |
2019. |
300 ## - PHYSICAL DESCRIPTION |
Extent |
1 online resource (175 pages) |
336 ## - |
-- |
text |
-- |
txt |
-- |
rdacontent |
337 ## - |
-- |
computer |
-- |
c |
-- |
rdamedia |
338 ## - |
-- |
online resource |
-- |
cr |
-- |
rdacarrier |
505 0# - FORMATTED CONTENTS NOTE |
Formatted contents note |
Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Up and Running with Reinforcement Learning; Why RL?; Formulating the RL problem; The relationship between an agent and its environment; Defining the states of the agent; Defining the actions of the agent; Understanding policy, value, and advantage functions; Identifying episodes; Identifying reward functions and the concept of discounted rewards; Rewards; Learning the Markov decision process ; Defining the Bellman equation; On-policy versus off-policy learning |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
On-policy methodOff-policy method; Model-free and model-based training; Algorithms covered in this book; Summary; Questions; Further reading; Chapter 2: Temporal Difference, SARSA, and Q-Learning; Technical requirements; Understanding TD learning; Relation between the value functions and state; Understanding SARSA and Q-Learning ; Learning SARSA ; Understanding Q-learning; Cliff walking and grid world problems; Cliff walking with SARSA; Cliff walking with Q-learning; Grid world with SARSA; Summary; Further reading; Chapter 3: Deep Q-Network; Technical requirements |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
Learning the theory behind a DQNUnderstanding target networks; Learning about replay buffer; Getting introduced to the Atari environment; Summary of Atari games; Pong; Breakout; Space Invaders; LunarLander; The Arcade Learning Environment ; Coding a DQN in TensorFlow; Using the model.py file; Using the funcs.py file; Using the dqn.py file; Evaluating the performance of the DQN on Atari Breakout; Summary; Questions; Further reading; Chapter 4: Double DQN, Dueling Architectures, and Rainbow; Technical requirements; Understanding Double DQN ; Coding DDQN and training to play Atari Breakout |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
Evaluating the performance of DDQN on Atari BreakoutUnderstanding dueling network architectures; Coding dueling network architecture and training it to play Atari Breakout; Combining V and A to obtain Q; Evaluating the performance of dueling architectures on Atari Breakout ; Understanding Rainbow networks; DQN improvements; Prioritized experience replay ; Multi-step learning; Distributional RL; Noisy nets; Running a Rainbow network on Dopamine; Rainbow using Dopamine; Summary; Questions; Further reading; Chapter 5: Deep Deterministic Policy Gradient; Technical requirements |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
Actor-Critic algorithms and policy gradientsPolicy gradient; Deep Deterministic Policy Gradient; Coding ddpg.py; Coding AandC.py; Coding TrainOrTest.py; Coding replay_buffer.py; Training and testing the DDPG on Pendulum-v0; Summary; Questions; Further reading; Chapter 6: Asynchronous Methods -- A3C and A2C; Technical requirements; The A3C algorithm; Loss functions; CartPole and LunarLander; CartPole; LunarLander; The A3C algorithm applied to CartPole; Coding cartpole.py; Coding a3c.py; The AC class; The Worker() class; Coding utils.py; Training on CartPole |
500 ## - GENERAL NOTE |
General note |
The A3C algorithm applied to LunarLander |
520 ## - SUMMARY, ETC. |
Summary, etc |
This book is an essential guide for anyone interested in Reinforcement Learning. The book provides an actionable reference for Reinforcement Learning algorithms and their applications using TensorFlow and Python. It will help readers leverage the power of algorithms such as Deep Q-Network (DQN), Deep Deterministic Policy Gradients (DDPG), and ... |
588 0# - |
-- |
Print version record. |
504 ## - BIBLIOGRAPHY, ETC. NOTE |
Bibliography, etc |
Includes bibliographical references. |
590 ## - LOCAL NOTE (RLIN) |
Local note |
eBooks on EBSCOhost |
Provenance (VM) [OBSOLETE] |
EBSCO eBook Subscription Academic Collection - Worldwide |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Python (Computer program language) |
9 (RLIN) |
63272 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Artificial intelligence. |
9 (RLIN) |
2458 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Machine learning. |
9 (RLIN) |
90912 |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Python (Langage de programmation) |
9 (RLIN) |
917996 |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Intelligence artificielle. |
9 (RLIN) |
869534 |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Apprentissage automatique. |
9 (RLIN) |
869505 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
artificial intelligence. |
Source of heading or term |
aat |
9 (RLIN) |
2458 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Database design & theory. |
Source of heading or term |
bicssc |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Mathematical theory of computation. |
Source of heading or term |
bicssc |
9 (RLIN) |
855105 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Machine learning. |
Source of heading or term |
bicssc |
9 (RLIN) |
90912 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Information architecture. |
Source of heading or term |
bicssc |
9 (RLIN) |
1034734 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Artificial intelligence. |
Source of heading or term |
bicssc |
9 (RLIN) |
2458 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Computers |
General subdivision |
Machine Theory. |
Source of heading or term |
bisacsh |
9 (RLIN) |
38226 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Computers |
General subdivision |
Data Modeling & Design. |
Source of heading or term |
bisacsh |
9 (RLIN) |
942725 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Computers |
General subdivision |
Intelligence (AI) & Semantics. |
Source of heading or term |
bisacsh |
9 (RLIN) |
855104 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Artificial intelligence. |
Source of heading or term |
fast |
-- |
(OCoLC)fst00817247 |
9 (RLIN) |
2458 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Machine learning. |
Source of heading or term |
fast |
-- |
(OCoLC)fst01004795 |
9 (RLIN) |
90912 |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Python (Computer program language) |
Source of heading or term |
fast |
-- |
(OCoLC)fst01084736 |
9 (RLIN) |
63272 |
655 #0 - INDEX TERM--GENRE/FORM |
Genre/form data or focus term |
Electronic books. |
655 #4 - INDEX TERM--GENRE/FORM |
Genre/form data or focus term |
Electronic books. |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY |
Display text |
Print version: |
Main entry heading |
Balakrishnan, Kaushik. |
Title |
TensorFlow Reinforcement Learning Quick Start Guide : Get up and Running with Training and Deploying Intelligent, Self-Learning Agents Using Python. |
Place, publisher, and date of publication |
Birmingham : Packt Publishing Ltd, ©2019 |
International Standard Book Number |
9781789533583 |
856 40 - ELECTRONIC LOCATION AND ACCESS |
Uniform Resource Identifier |
<a href="https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2094787">https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2094787</a> |
938 ## - |
-- |
Askews and Holts Library Services |
-- |
ASKH |
-- |
AH36155814 |
938 ## - |
-- |
ProQuest Ebook Central |
-- |
EBLB |
-- |
EBL5744473 |
938 ## - |
-- |
EBSCOhost |
-- |
EBSC |
-- |
2094787 |
994 ## - |
-- |
92 |
-- |
INOPJ |