Reinforcement Learning And Dynamic Programming Using Function Approximators (automation And Control Engineering)

E-Book Overview

From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control them. While Dynamic Programming (DP) has provided researchers with a way to optimally solve decision and control problems involving complex dynamic systems, its practical value was limited by algorithms that lacked the capacity to scale up to realistic problems.  However, in recent years, dramatic developments in Reinforcement Learning (RL), the model-free counterpart of DP, changed our understanding of what is possible. Those developments led to the creation of reliable methods that can be applied even when a mathematical model of the system is unavailable, allowing researchers to solve challenging control problems in engineering, as well as in a variety of other disciplines, including economics, medicine, and artificial intelligence. Reinforcement Learning and Dynamic Programming Using Function Approximators provides a comprehensive and unparalleled exploration of the field of RL and DP. With a focus on continuous-variable problems, this seminal text details essential developments that have substantially altered the field over the past decade. In its pages, pioneering experts provide a concise introduction to classical RL and DP, followed by an extensive presentation of the state-of-the-art and novel methods in RL and DP with approximation. Combining algorithm development with theoretical guarantees, they elaborate on their work with illustrative examples and insightful comparisons. Three individual chapters are dedicated to representative algorithms from each of the major classes of techniques: value iteration, policy iteration, and policy search. The features and performance of these algorithms are highlighted in extensive experimental studies on a range of control applications. The recent development of applications involving complex systems has led to a surge of interest in RL and DP methods and the subsequent need for a quality resource on the subject. For graduate students and others new to the field, this book offers a thorough introduction to both the basics and emerging methods. And for those researchers and practitioners working in the fields of optimal and adaptive control, machine learning, artificial intelligence, and operations research, this resource offers a combination of practical algorithms, theoretical analysis, and comprehensive examples that they will be able to adapt and apply to their own work. Access the authors' website at www.dcsc.tudelft.nl/rlbook/ for additional material, including computer code used in the studies and information concerning new developments.

E-Book Content

Reinforcement Learning and Dynamic Programming Using Function Approximators K11117_FM.indd 1 3/22/10 4:10:24 AM AUTOMATION AND CONTROL ENGINEERING A Series of Reference Books and Textbooks Series Editors FRANK L. LEWIS, Ph.D., Fellow IEEE, Fellow IFAC Professor Automation and Robotics Research Institute The University of Texas at Arlington SHUZHI SAM GE, Ph.D., Fellow IEEE Professor Interactive Digital Media Institute The National University of Singapore Reinforcement Learning and Dynamic Programming Using Function Approximators, Lucian Bus¸oniu, Robert Babuška, Bart De Schutter, and Damien Ernst Modeling and Control of Vibration in Mechanical Systems, Chunling Du and Lihua Xie Analysis and Synthesis of Fuzzy Control Systems: A Model-Based Approach, Gang Feng Lyapunov-Based Control of Robotic Systems, Aman Behal, Warren Dixon, Darren M. Dawson, and Bin Xian System Modeling and Control with Resource-Oriented Petri Nets, Naiqi Wu and MengChu Zhou Sliding Mode Control in Electro-Mechanical Systems, Second Edition, Vadim Utkin, Jürgen Guldner, and Jingxin Shi Optimal Control: Weakly Coupled Systems and Applications, Zo
You might also like

Introduction To Parallel Computing: [a Practical Guide With Examples In C]
Authors: W. P. Petersen , P. Arbenz    84    0



Pattern Classification
Authors: Richard O. Duda , Peter E. Hart , David G. Stork    122    0


Principles Of Constraint Programming
Authors: Krzysztof Apt    107    0


современный фортран
Authors: Бартеньев О.    189    0


Encyclopedia Of Physical Science And Technology - Computer Software
Authors: Robert A. Meyers (Editor-in-Chief)    144    0


Webster's New World Telecom Dictionary
Authors: Ray Horak    129    0


Algorithmic Game Theory
Authors: Noam Nisan , Tim Roughgarden , Eva Tardos , Vijay V. Vazirani    138    0



Tex For The Impatient
Authors: Abrahams P.W. , Hargreaves K.A. , Berry K.    118    0