Reinforcement Learning: An Introduction

Provides a clear and simple account of the key ideas and algorithms of reinforcement learning. Familiarity with elementary concepts of probability is assumed.

**Publication date**: 01 Mar 1998

**ISBN-10**:
0262193981

**ISBN-13**:
9780262193986

**Paperback**:
322 pages

**Views**: 16,387

Reinforcement Learning: An Introduction

Provides a clear and simple account of the key ideas and algorithms of reinforcement learning. Familiarity with elementary concepts of probability is assumed.

Book Description:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.

The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Reviews:

Amazon.com

:) "The book is very readable by average computer students. Possibly the only difficult one is chapter 8, which deals with some neural network concepts. I highly recommend this book to anyone who wants to learn about this subject. "

:) "The book is easy and interesting to read. The diagrams, especially those on TD, throw a great deal of insight on the basic concept of TD. The intuitive ideas behind RL are developed clearly. At the same time all the fundamental concepts are made mathematically precise using very simple language and notation. Anybody new to RL should find this book extremely useful."

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.

The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Reviews:

Amazon.com

:) "The book is very readable by average computer students. Possibly the only difficult one is chapter 8, which deals with some neural network concepts. I highly recommend this book to anyone who wants to learn about this subject. "

:) "The book is easy and interesting to read. The diagrams, especially those on TD, throw a great deal of insight on the basic concept of TD. The intuitive ideas behind RL are developed clearly. At the same time all the fundamental concepts are made mathematically precise using very simple language and notation. Anybody new to RL should find this book extremely useful."

Tweet

About The Author(s)

Andrew Barto is Professor Emeritus in the College of Information and Computer Sciences at University of Massachusetts Amherst. He is a Co-Director at Autonomous Learning Laboratory. His research interests are theory and application of methods for learning and planning in stochastic sequential decision problems; algebraic approaches to abstraction; psychology, neuroscience, and computational theory of motivation, reward, and addiction; computational models of learning and adaptation in animal motor control systems.

Richard S. Sutton is Professor and iCORE chair Department of Computing Science at University of Alberta. Dr. Sutton is considered one of the founding fathers of modern computational reinforcement learning, having several significant contributions to the field, including temporal difference learning, policy gradient methods, the Dyna architecture.

Book Categories

Computer Science
45
Introduction to Computer Science
46
Algorithms and Data Structures
24
Object Oriented Programming
24
Theory of Computation
20
Formal Methods
19
Functional Programming
10
Logic Programming
24
Artificial Intelligence
22
Computer Vision
14
Big Data
1
Agile Software Development
5
Neural Networks
19
Compiler Design and Construction
5
Computer Security
15
Computer Organization and Architecture
9
Data Science
9
Parallel Computing
3
Concurrent Programming
22
Operating Systems
22
Data Communication and Networks
35
Information Security
6
Information Theory
23
Digital Libraries
15
Information Systems
62
Software Engineering
17
Game Development and Multimedia
13
Data Mining
23
Machine Learning

Mathematics
67
Mathematics
2
Precalculus
10
Algebra
6
Calculus
5
Category Theory
25
Linear Algebra
16
Computer Aided Mathematics
6
Proofs
18
Discrete Mathematics
6
Numerical Methods
3
Number Theory
11
Graph Theory
12
Operations Research
1
Complex Analysis
5
Queueing Theory
34
Statistics
10
Probability

Supporting Fields
9
Electric Circuits
24
Signal Processing
14
Web Design and Development
2
Document-oriented Database
10
Relational Database
2
Cloud Computing
3
Network Science

Operating System
Programming/Scripting
6
Ada
12
Assembly
34
C / C++
8
Common Lisp
2
Forth
34
Java
11
JavaScript
1
Lua
15
Microsoft .NET
1
Rexx
12
Perl
5
PHP
63
Python
7
R
1
Rebol
13
Ruby
2
Scheme
3
Tcl/Tk

Miscellaneous