Evaluation of batch-mode reinforcement learning methods for solving DEC-MDPs with changing action sets
Autor(en): | Gabel, T. Riedmiller, M. |
Stichwörter: | Agents; Algorithms; Education; Learning systems; Reinforcement; Reinforcement learning, Bench-mark problems; Iteration algorithms; Learning approaches; Lower complexity; Reinforcement learning (RL) methods; Reinforcement Learning algorithms, Learning algorithms | Erscheinungsdatum: | 2008 | Journal: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | Volumen: | 5323 LNAI | Startseite: | 82 | Seitenende: | 95 | Zusammenfassung: | DEC-MDPs with changing action sets and partially ordered transition dependencies have recently been suggested as a sub-class of general DEC-MDPs that features provably lower complexity. In this paper, we investigate the usability of a coordinated batch-mode reinforcement learning algorithm for this class of distributed problems. Our agents acquire their local policies independent of the other agents by repeated interaction with the DEC-MDP and concurrent evolvement of their policies, where the learning approach employed builds upon a specialized variant of a neural fitted Q iteration algorithm, enhanced for use in multi-agent settings. We applied our learning approach to various scheduling benchmark problems and obtained encouraging results that show that problems of current standards of difficulty can very well approximately, and in some cases optimally be solved. © 2008 Springer Berlin Heidelberg. |
Beschreibung: | Conference of 8th European Workshop on Reinforcement Learning, EWRL 2008 ; Conference Date: 30 June 2008 Through 3 July 2008; Conference Code:75100 |
ISBN: | 9783540897217 | ISSN: | 03029743 | DOI: | 10.1007/978-3-540-89722-4_7 | Externe URL: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-58449107110&doi=10.1007%2f978-3-540-89722-4_7&partnerID=40&md5=13db49efc59e2bbffaa00a6042d09c20 |
Zur Langanzeige
Seitenaufrufe
1
Letzte Woche
0
0
Letzter Monat
0
0
geprüft am 19.05.2024