LEADER 02759nam 2200637 a 450 001 9911019143403321 005 20200520144314.0 010 $a1-299-18985-7 010 $a1-118-45393-X 024 7 $a10.1002/9781118453988 035 $a(CKB)2670000000327913 035 $a(EBL)947728 035 $a(SSID)ssj0000826853 035 $a(PQKBManifestationID)11434333 035 $a(PQKBTitleCode)TC0000826853 035 $a(PQKBWorkID)10808932 035 $a(PQKB)11010193 035 $a(MiAaPQ)EBC947728 035 $a(CaBNVSL)mat06462203 035 $a(IDAMS)0b00006481cd5eff 035 $a(IEEE)6462203 035 $a(OCoLC)798809964 035 $a(PPN)257508333 035 $a(EXLCZ)992670000000327913 100 $a20120706d2013 uy 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 00$aReinforcement learning and approximate dynamic programming for feedback control /$fedited by Frank L. Lewis, Derong Liu 210 $aHoboken, N.J. $cIEEE/John Wiley and Sons, Inc.$d2013 215 $a1 online resource (643 p.) 225 1 $aIEEE Press series on computational intelligence 300 $aDescription based upon print version of record. 311 $a1-118-45398-0 311 $a1-118-10420-X 320 $aIncludes bibliographical references and index. 327 $apt. 1. Feedback control using RL and ADP -- pt. 2. Learning and control in multiagent games -- pt. 3. Foundations in MDP and RL. 330 $a"Reinforcement learning (RL) and adaptive dynamic programming (ADP) has been one of the most critical research fields in science and engineering for modern complex systems. This book describes the latest RL and ADP techniques for decision and control in human engineered systems, covering both single player decision and control and multi-player games. Edited by the pioneers of RL and ADP research, the book brings together ideas and methods from many fields and provides an important and timely guidance on controlling a wide variety of systems, such as robots, industrial processes, and economic decision-making"--$cProvided by publisher. 410 0$aIEEE series on computational intelligence. 606 $aReinforcement learning 606 $aFeedback control systems 615 0$aReinforcement learning. 615 0$aFeedback control systems. 676 $a003/.5 686 $aTEC008000$2bisacsh 701 $aLewis$b Frank L$030830 701 $aLiu$b Derong$f1963-$066913 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9911019143403321 996 $aReinforcement learning and approximate dynamic programming for feedback control$94420481 997 $aUNINA