03099oam 2200637I 450 991045905020332120200520144314.01-315-21793-71-282-90296-297866129029631-4398-2109-710.1201/9781439821091 (CKB)2670000000047153(EBL)589872(OCoLC)666378166(SSID)ssj0000426925(PQKBManifestationID)11302179(PQKBTitleCode)TC0000426925(PQKBWorkID)10390207(PQKB)10539407(MiAaPQ)EBC589872(PPN)168822601(Au-PeEL)EBL589872(CaPaEBR)ebr10419897(CaONFJC)MIL290296(EXLCZ)99267000000004715320180331d2010 uy 0engur|n|---|||||txtccrReinforcement learning and dynamic programming using function approximators // Lucian Busoniu. [et al]Boca Raton :CRC Press,2010.1 online resource (285 p.)Automation and control engineeringDescription based upon print version of record.1-4398-2108-9 Includes bibliographical references and index.Cover; Title; Copyright; Preface; About the authors; Contents; 1 Introduction; 2 An introduction to dynamic programming and reinforcement learning; 3 Dynamic programming and reinforcement learning in large and continuous spaces; 4 Approximate value iteration with a fuzzy representation; 5 Approximate policy iteration for online learning and continuous-action control; 6 Approximate policy search with cross-entropy optimization of basis functions; Appendix A: Extremely randomized trees; Appendix B: The cross-entropy method; Symbols and abbreviations; Bibliography; List of algorithms; IndexFrom household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control them. While Dynamic Programming (DP) has provided researchers with a way to optimally solve decision and control problems involving complex dynamic systems, its practical value was limited by algorithms that lacked the capacity to scale up to realistic problems. However, in recent years, dramatic developments in Reinforcement Learning (RL), the model-free counterpart of DP, changed our understanding of what is possible. Those devAutomation and control engineering.Digital control systemsDynamic programmingElectronic books.Digital control systems.Dynamic programming.629.8/9Busoniu Lucian989644MiAaPQMiAaPQMiAaPQBOOK9910459050203321Reinforcement learning and dynamic programming using function approximators2263490UNINA