LEADER 05186nam 22007935 450 001 996466337803316 005 20200721090208.0 010 $a3-540-89722-4 024 7 $a10.1007/978-3-540-89722-4 035 $a(CKB)1000000000545882 035 $a(SSID)ssj0000319771 035 $a(PQKBManifestationID)11247499 035 $a(PQKBTitleCode)TC0000319771 035 $a(PQKBWorkID)10338603 035 $a(PQKB)11381856 035 $a(DE-He213)978-3-540-89722-4 035 $a(MiAaPQ)EBC3063770 035 $a(PPN)132861577 035 $a(EXLCZ)991000000000545882 100 $a20100301d2008 u| 0 101 0 $aeng 135 $aurnn#008mamaa 181 $ctxt 182 $cc 183 $acr 200 10$aRecent Advances in Reinforcement Learning$b[electronic resource] $e8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30-July 3, 2008, Revised and Selected Papers /$fedited by Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko 205 $a1st ed. 2008. 210 1$aBerlin, Heidelberg :$cSpringer Berlin Heidelberg :$cImprint: Springer,$d2008. 215 $a1 online resource (XII, 283 p.) 225 1 $aLecture Notes in Artificial Intelligence ;$v5323 300 $aInternational conference proceedings. 311 $a3-540-89721-6 320 $aIncludes bibliographical references and index. 327 $aLazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees -- Exploiting Additive Structure in Factored MDPs for Reinforcement Learning -- Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration -- Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case -- Regularized Fitted Q-Iteration: Application to Planning -- A Near Optimal Policy for Channel Allocation in Cognitive Radio -- Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets -- Bayesian Reward Filtering -- Basis Expansion in Natural Actor Critic Methods -- Reinforcement Learning with the Use of Costly Features -- Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem -- Optimistic Planning of Deterministic Systems -- Policy Iteration for Learning an Exercise Policy for American Options -- Tile Coding Based on Hyperplane Tiles -- Use of Reinforcement Learning in Two Real Applications -- Applications of Reinforcement Learning to Structured Prediction -- Policy Learning ? A Unified Perspective with Applications in Robotics -- Probabilistic Inference for Fast Learning in Control -- United We Stand: Population Based Methods for Solving Unknown POMDPs -- New Error Bounds for Approximations from Projected Linear Equations -- Markov Decision Processes with Arbitrary Reward Processes. 330 $aThis book constitutes revised and selected papers of the 8th European Workshop on Reinforcement Learning, EWRL 2008, which took place in Villeneuve d'Ascq, France, during June 30 - July 3, 2008. The 21 papers presented were carefully reviewed and selected from 61 submissions. They are dedicated to the field of and current researches in reinforcement learning. 410 0$aLecture Notes in Artificial Intelligence ;$v5323 606 $aArtificial intelligence 606 $aComputer programming 606 $aComputers 606 $aApplication software 606 $aDatabase management 606 $aArtificial Intelligence$3https://scigraph.springernature.com/ontologies/product-market-codes/I21000 606 $aProgramming Techniques$3https://scigraph.springernature.com/ontologies/product-market-codes/I14010 606 $aTheory of Computation$3https://scigraph.springernature.com/ontologies/product-market-codes/I16005 606 $aComputation by Abstract Devices$3https://scigraph.springernature.com/ontologies/product-market-codes/I16013 606 $aInformation Systems Applications (incl. Internet)$3https://scigraph.springernature.com/ontologies/product-market-codes/I18040 606 $aDatabase Management$3https://scigraph.springernature.com/ontologies/product-market-codes/I18024 615 0$aArtificial intelligence. 615 0$aComputer programming. 615 0$aComputers. 615 0$aApplication software. 615 0$aDatabase management. 615 14$aArtificial Intelligence. 615 24$aProgramming Techniques. 615 24$aTheory of Computation. 615 24$aComputation by Abstract Devices. 615 24$aInformation Systems Applications (incl. Internet). 615 24$aDatabase Management. 676 $a006.3/1 702 $aGirgin$b Sertan$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aLoth$b Manuel$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aMunos$b Rémi$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aPreux$b Philippe$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aRyabko$b Daniil$4edt$4http://id.loc.gov/vocabulary/relators/edt 712 12$aEWRL 2008 906 $aBOOK 912 $a996466337803316 996 $aRecent Advances in Reinforcement Learning$9774130 997 $aUNISA