A Survey of Domain-Specific Architectures for Reinforcement Learning

DC ElementWertSprache
dc.contributor.authorRothmann, Marc
dc.contributor.authorPorrmann, Mario
dc.date.accessioned2023-02-17T11:32:14Z-
dc.date.available2023-02-17T11:32:14Z-
dc.date.issued2022
dc.identifier.issn2169-3536
dc.identifier.urihttp://osnascholar.ub.uni-osnabrueck.de/handle/unios/65302-
dc.description.abstractReinforcement learning algorithms have been very successful at solving sequential decision-making problems in many different problem domains. However, their training is often time-consuming, with training times ranging from multiple hours to weeks. The development of domain-specific architectures for reinforcement learning promises faster computation times, decreased experiment turn-around time, and improved energy efficiency. This paper presents a review of hardware architectures for the acceleration of reinforcement learning algorithms. FPGA-based implementations are the focus of this work, but GPU-based approaches are considered as well. Both tabular and deep reinforcement learning algorithms are included in this survey. The techniques employed in different implementations are highlighted and compared. Finally, possible areas for future work are suggested, based on the preceding discussion of existing architectures.
dc.description.sponsorshipEuropean Union's Horizon 2020 Research and Innovation Program through the Very Efficient Deep Learning in IoT (VEDLIoT) Project [957197]; This work was supported by the European Union's Horizon 2020 Research and Innovation Program through the Very Efficient Deep Learning in IoT (VEDLIoT) Project under Grant 957197.
dc.language.isoen
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
dc.relation.ispartofIEEE ACCESS
dc.subjectComputer architecture
dc.subjectComputer Science
dc.subjectComputer Science, Information Systems
dc.subjectdeep learning
dc.subjectdeep reinforcement learning
dc.subjectDomain-specific architectures
dc.subjectEngineering
dc.subjectEngineering, Electrical & Electronic
dc.subjectFPGA
dc.subjectGraphics processing units
dc.subjectIMPLEMENTATION
dc.subjectLEVEL
dc.subjectmachine learning
dc.subjectNeural networks
dc.subjectOptimization
dc.subjectQ-learning
dc.subjectreconfigurable architectures
dc.subjectreinforcement learning
dc.subjectTelecommunications
dc.subjectTraining
dc.titleA Survey of Domain-Specific Architectures for Reinforcement Learning
dc.typejournal article
dc.identifier.doi10.1109/ACCESS.2022.3146518
dc.identifier.isiISI:000753402400001
dc.description.volume10
dc.description.startpage13753
dc.description.endpage13767
dc.contributor.orcid0000-0003-2886-8197
dc.publisher.place445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA
dcterms.isPartOf.abbreviationIEEE Access
dcterms.oaStatusgold
local.import.remainsaffiliations : University Osnabruck
local.import.remainsweb-of-science-index : Science Citation Index Expanded (SCI-EXPANDED)
crisitem.author.deptFB 06 - Mathematik/Informatik-
crisitem.author.deptidfb06-
crisitem.author.orcid0000-0003-1005-5753-
crisitem.author.parentorgUniversität Osnabrück-
crisitem.author.netidPoMa309-
Zur Kurzanzeige

Seitenaufrufe

8
Letzte Woche
0
Letzter Monat
2
geprüft am 23.05.2024

Google ScholarTM

Prüfen

Altmetric