Evaluation of batch-mode reinforcement learning methods for solving DEC-MDPs with changing action sets

Autor(en): Gabel, T.
Riedmiller, M.
Stichwörter: Agents; Algorithms; Education; Learning systems; Reinforcement; Reinforcement learning, Bench-mark problems; Iteration algorithms; Learning approaches; Lower complexity; Reinforcement learning (RL) methods; Reinforcement Learning algorithms, Learning algorithms
Erscheinungsdatum: 2008
Journal: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 5323 LNAI
Startseite: 82
Seitenende: 95
Zusammenfassung: 
DEC-MDPs with changing action sets and partially ordered transition dependencies have recently been suggested as a sub-class of general DEC-MDPs that features provably lower complexity. In this paper, we investigate the usability of a coordinated batch-mode reinforcement learning algorithm for this class of distributed problems. Our agents acquire their local policies independent of the other agents by repeated interaction with the DEC-MDP and concurrent evolvement of their policies, where the learning approach employed builds upon a specialized variant of a neural fitted Q iteration algorithm, enhanced for use in multi-agent settings. We applied our learning approach to various scheduling benchmark problems and obtained encouraging results that show that problems of current standards of difficulty can very well approximately, and in some cases optimally be solved. © 2008 Springer Berlin Heidelberg.
Beschreibung: 
Conference of 8th European Workshop on Reinforcement Learning, EWRL 2008 ; Conference Date: 30 June 2008 Through 3 July 2008; Conference Code:75100
ISBN: 9783540897217
ISSN: 03029743
DOI: 10.1007/978-3-540-89722-4_7
Externe URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-58449107110&doi=10.1007%2f978-3-540-89722-4_7&partnerID=40&md5=13db49efc59e2bbffaa00a6042d09c20

Zur Langanzeige

Seitenaufrufe

1
Letzte Woche
0
Letzter Monat
0
geprüft am 19.05.2024

Google ScholarTM

Prüfen

Altmetric