Reinforcement Learning: An Introduction

Provides a clear and simple account of the key ideas and algorithms of reinforcement learning. Familiarity with elementary concepts of probability is assumed.

**Publication date**: 01 Mar 1998

**ISBN-10**:
0262193981

**ISBN-13**:
9780262193986

**Paperback**:
322 pages

**Views**: 16,075

Reinforcement Learning: An Introduction

Provides a clear and simple account of the key ideas and algorithms of reinforcement learning. Familiarity with elementary concepts of probability is assumed.

Book Description:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.

The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Reviews:

Amazon.com

:) "The book is very readable by average computer students. Possibly the only difficult one is chapter 8, which deals with some neural network concepts. I highly recommend this book to anyone who wants to learn about this subject. "

:) "The book is easy and interesting to read. The diagrams, especially those on TD, throw a great deal of insight on the basic concept of TD. The intuitive ideas behind RL are developed clearly. At the same time all the fundamental concepts are made mathematically precise using very simple language and notation. Anybody new to RL should find this book extremely useful."

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.

The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Reviews:

Amazon.com

:) "The book is very readable by average computer students. Possibly the only difficult one is chapter 8, which deals with some neural network concepts. I highly recommend this book to anyone who wants to learn about this subject. "

:) "The book is easy and interesting to read. The diagrams, especially those on TD, throw a great deal of insight on the basic concept of TD. The intuitive ideas behind RL are developed clearly. At the same time all the fundamental concepts are made mathematically precise using very simple language and notation. Anybody new to RL should find this book extremely useful."

Tweet

About The Author(s)

Andrew Barto is Professor Emeritus in the College of Information and Computer Sciences at University of Massachusetts Amherst. He is a Co-Director at Autonomous Learning Laboratory. His research interests are theory and application of methods for learning and planning in stochastic sequential decision problems; algebraic approaches to abstraction; psychology, neuroscience, and computational theory of motivation, reward, and addiction; computational models of learning and adaptation in animal motor control systems.

Richard S. Sutton is Professor and iCORE chair Department of Computing Science at University of Alberta. Dr. Sutton is considered one of the founding fathers of modern computational reinforcement learning, having several significant contributions to the field, including temporal difference learning, policy gradient methods, the Dyna architecture.

Book Categories

Computer Science
40
Introduction to Computer Science
40
Algorithms and Data Structures
24
Object Oriented Programming
21
Theory of Computation
19
Formal Methods
18
Functional Programming
10
Logic Programming
23
Artificial Intelligence
22
Computer Vision
10
Big Data
3
Neural Networks
19
Compiler Design and Construction
16
Computer Organization and Architecture
9
Parallel Computing
3
Concurrent Programming
22
Operating Systems
22
Data Communication and Networks
29
Information Security
6
Information Theory
23
Digital Libraries
14
Information Systems
61
Software Engineering
17
Game Development and Multimedia
10
Data Mining
21
Machine Learning

Mathematics
65
Mathematics
2
Precalculus
9
Algebra
6
Calculus
5
Category Theory
25
Linear Algebra
16
Computer Aided Mathematics
5
Proofs
15
Discrete Mathematics
6
Numerical Methods
3
Number Theory
10
Graph Theory
11
Operations Research
1
Complex Analysis
5
Queueing Theory
31
Statistics
9
Probability

Supporting Fields
9
Electric Circuits
22
Signal Processing
14
Web Design and Development
9
Database Management System
1
Cloud Computing

Operating System
Programming/Scripting
6
Ada
12
Assembly
34
C / C++
8
Common Lisp
2
Forth
34
Java
12
JavaScript
1
Lua
16
Microsoft .NET
12
Perl
5
PHP
55
Python
1
Rebol
13
Ruby
2
Scheme
3
Tcl/Tk

Miscellaneous