Ebook: Reinforcement Learning

Author: Richard S. Sutton (auth.) Richard S. Sutton (eds.)

Tags: Artificial Intelligence (incl. Robotics), Statistical Physics Dynamical Systems and Complexity
Series: The Springer International Series in Engineering and Computer Science 173
Year: 1992
Publisher: Springer US
Edition: 1
Language: English
pdf

00

27.01.2024

0

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning.
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers.

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning.
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers.

Content:
Front Matter....Pages iii-vi
Introduction: The Challenge of Reinforcement Learning....Pages 1-3
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning....Pages 5-32
Practical Issues in Temporal Difference Learning....Pages 33-53
Technical Note....Pages 55-68
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching....Pages 69-97
Transfer of Learning by Composing Solutions of Elemental Sequential Tasks....Pages 99-115
The Convergence of TD(?) for General ?....Pages 117-138
A Reinforcement Connectionist Approach to Robot Path Finding in Non-Maze-Like Environments....Pages 139-171
Back Matter....Pages 172-172

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning.
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers.

Content:
Front Matter....Pages iii-vi
Introduction: The Challenge of Reinforcement Learning....Pages 1-3
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning....Pages 5-32
Practical Issues in Temporal Difference Learning....Pages 33-53
Technical Note....Pages 55-68
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching....Pages 69-97
Transfer of Learning by Composing Solutions of Elemental Sequential Tasks....Pages 99-115
The Convergence of TD(?) for General ?....Pages 117-138
A Reinforcement Connectionist Approach to Robot Path Finding in Non-Maze-Like Environments....Pages 139-171
Back Matter....Pages 172-172
....

Download the book Reinforcement Learning for free or read online

Read Download

Problems?

Download pdf book

Continue reading on any device:
QR code

Last viewed books

Related books