|
|
|
|
|
|
|
|
1. |
Record Nr. |
UNINA9910633938103321 |
|
|
Autore |
Rizvi Syed Ali Asad |
|
|
Titolo |
Output feedback reinforcement learning control for linear systems / / Syed Ali Asad Rizvi, Zongli Lin |
|
|
|
|
|
|
|
Pubbl/distr/stampa |
|
|
Cham, Switzerland : , : Birkhäuser, , [2023] |
|
©2023 |
|
|
|
|
|
|
|
|
|
ISBN |
|
|
|
|
|
|
Descrizione fisica |
|
1 online resource (304 pages) |
|
|
|
|
|
|
Collana |
|
|
|
|
|
|
Disciplina |
|
|
|
|
|
|
Soggetti |
|
Control theory |
Feedback control systems |
Reinforcement learning |
|
|
|
|
|
|
|
|
Lingua di pubblicazione |
|
|
|
|
|
|
Formato |
Materiale a stampa |
|
|
|
|
|
Livello bibliografico |
Monografia |
|
|
|
|
|
Nota di bibliografia |
|
Includes bibliographical references and index. |
|
|
|
|
|
|
Nota di contenuto |
|
Intro -- Preface -- Contents -- Notation and Acronyms -- 1 Introduction to Optimal Control and Reinforcement Learning -- 1.1 Introduction -- 1.2 Optimal Control of Dynamic Systems -- 1.2.1 Dynamic Programming Method -- 1.2.2 The Linear Quadratic Regulation Problem -- 1.2.3 Iterative Numerical Methods -- 1.3 Reinforcement Learning Based Optimal Control -- 1.3.1 Principles of Reinforcement Learning -- 1.3.2 Reinforcement Learning for Automatic Control -- 1.3.3 Advantages of Reinforcement Learning Control -- Optimality and Adaptivity -- Model-Free Control -- Large Spectrum of Applications -- 1.3.4 Limitations of Reinforcement Learning Control -- 1.3.5 Reinforcement Learning Algorithms -- 1.4 Recent Developments and Challenges in Reinforcement Learning Control -- 1.4.1 State Feedback versus Output Feedback Designs -- 1.4.2 Exploration Signal/Noise and Estimation Bias -- 1.4.3 Discounted versus Undiscounted Cost Functions -- 1.4.4 Requirement of a Stabilizing Initial Policy -- 1.4.5 Optimal Tracking Problems -- 1.4.6 Reinforcement Learning in Continuous-Time -- 1.4.7 Disturbance Rejection -- 1.4.8 Distributed Reinforcement Learning -- 1.5 Notes and References -- 2 Model-Free Design of Linear Quadratic Regulator -- 2.1 Introduction -- 2.2 Literature Review -- 2.3 Discrete-Time LQR Problem -- 2.3.1 Iterative Schemes Based on State Feedback -- 2.3.2 |
|
|
|
|