|
|
|
|
|
|
|
|
1. |
Record Nr. |
UNINA9910824178803321 |
|
|
Autore |
Schwartz Howard M. |
|
|
Titolo |
Multi-agent machine learning : a reinforcement approach / / Howard M. Schwartz |
|
|
|
|
|
|
|
Pubbl/distr/stampa |
|
|
Hoboken, New Jersey : , : John Wiley & Sons, Inc., , 2014 |
|
©2014 |
|
|
|
|
|
|
|
|
|
ISBN |
|
1-118-88448-5 |
1-118-88461-2 |
1-118-88447-7 |
|
|
|
|
|
|
|
|
Edizione |
[1st edition] |
|
|
|
|
|
Descrizione fisica |
|
1 online resource (458 p.) |
|
|
|
|
|
|
Classificazione |
|
|
|
|
|
|
Disciplina |
|
|
|
|
|
|
Soggetti |
|
Reinforcement learning |
Differential games |
Swarm intelligence |
Machine learning |
|
|
|
|
|
|
|
|
Lingua di pubblicazione |
|
|
|
|
|
|
Formato |
Materiale a stampa |
|
|
|
|
|
Livello bibliografico |
Monografia |
|
|
|
|
|
Note generali |
|
Description based upon print version of record. |
|
|
|
|
|
|
Nota di bibliografia |
|
Includes bibliographical references at the end of each chapters and index. |
|
|
|
|
|
|
|
|
Nota di contenuto |
|
Cover; Title Page; Copyright; Preface; References; Chapter 1: A Brief Review of Supervised Learning; 1.1 Least Squares Estimates; 1.2 Recursive Least Squares; 1.3 Least Mean Squares; 1.4 Stochastic Approximation; References; Chapter 2: Single-Agent Reinforcement Learning; 2.1 Introduction; 2.2 n-Armed Bandit Problem; 2.3 The Learning Structure; 2.4 The Value Function; 2.5 The Optimal Value Functions; 2.6 Markov Decision Processes; 2.7 Learning Value Functions; 2.8 Policy Iteration; 2.9 Temporal Difference Learning; 2.10 TD Learning of the State-Action Function; 2.11 Q-Learning |
2.12 Eligibility TracesReferences; Chapter 3: Learning in Two-Player Matrix Games; 3.1 Matrix Games; 3.2 Nash Equilibria in Two-Player Matrix Games; 3.3 Linear Programming in Two-Player Zero-Sum Matrix Games; 3.4 The Learning Algorithms; 3.5 Gradient Ascent Algorithm; 3.6 WoLF-IGA Algorithm; 3.7 Policy Hill Climbing (PHC); 3.8 WoLF-PHC Algorithm; 3.9 Decentralized Learning in Matrix Games; 3.10 Learning Automata; 3.11 Linear Reward-Inaction Algorithm; 3.12 Linear Reward- |
|
|
|
|