Learning policies for abstract state spaces

DC ElementWertSprache
dc.contributor.authorTimmer, S.
dc.contributor.authorRiedmiller, M.
dc.date.accessioned2021-12-23T16:28:05Z-
dc.date.available2021-12-23T16:28:05Z-
dc.date.issued2005
dc.identifier.issn1062922X
dc.identifier.urihttps://osnascholar.ub.uni-osnabrueck.de/handle/unios/15665-
dc.descriptionConference of IEEE Systems, Man and Cybernetics Society, Proceedings - 2005 International Conference on Systems, Man and Cybernetics ; Conference Date: 10 October 2005 Through 12 October 2005; Conference Code:66062
dc.description.abstractApplying Q-Learning to multidimensional, real-valued state spaces is time-consuming in most cases. In this article, we deal with the assumption that a coarse partition of the state space is sufficient for learning good or even optimal policies. An algorithm is presented which constructs proper policies for abstract state spaces using an incremental procedure without approximating a Q-function. By combining an approach similar to dynamic programming and a search for policies, we can speed up the learning process. To provide empirical evidence, we use a cart-pole system. Experiments were conducted for a simulated environment as well as for a real plant. © 2005 IEEE.
dc.description.sponsorshipIEEE Systems, Man and Cybernetics Society
dc.language.isoen
dc.relation.ispartofConference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
dc.subjectCart-pole system
dc.subjectLearning process
dc.subjectQ-function
dc.subjectState space, Approximation theory
dc.subjectDynamic programming
dc.subjectFunctions, Learning systems
dc.titleLearning policies for abstract state spaces
dc.typeconference paper
dc.identifier.scopus2-s2.0-27944452968
dc.identifier.urlhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-27944452968&partnerID=40&md5=e8dad867fe1a18ce4c9bfef988c42210
dc.description.volume4
dc.description.startpage3179
dc.description.endpage3184
dc.publisher.placeWaikoloa, HI
dcterms.isPartOf.abbreviationConf. Proc. IEEE Int. Conf. Syst. Man Cybern.
Zur Kurzanzeige

Seitenaufrufe

3
Letzte Woche
0
Letzter Monat
1
geprüft am 09.06.2024

Google ScholarTM

Prüfen