FAQ: A Flexible Accelerator for Q-Learning with Configurable Environment

DC ElementWertSprache
dc.contributor.authorRothmann, M.
dc.contributor.authorPorrmann, M.
dc.contributor.editorPericas, M.
dc.contributor.editorPnevmatikatos, D.N.
dc.contributor.editorTrancoso, P.P.M.
dc.contributor.editorSourdis, I.
dc.date.accessioned2023-02-17T12:15:14Z-
dc.date.available2023-02-17T12:15:14Z-
dc.date.issued2022
dc.identifier.isbn9781665483087
dc.identifier.issn1063-6862
dc.identifier.urihttp://osnascholar.ub.uni-osnabrueck.de/handle/unios/65890-
dc.descriptionConference of 33rd IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2022 ; Conference Date: 12 July 2022 Through 14 July 2022; Conference Code:183622
dc.description.abstractReinforcement Learning is an area of machine learning that is concerned with optimizing the behavior of an agent in an environment by maximizing cumulative rewards. This can be done with classical reinforcement learning algorithms such as Q-Learning and SARSA. This paper presents FAQ, a flexible FPGA-based accelerator for the Q-Learning algorithm. The architecture of the accelerator can be configured in multiple ways, like adjusting the bit width of Q-values or changing the number of pipeline stages. The evaluation shows that FAQ achieves 249% higher throughput than state-of-the-art FPGA implementations while decreasing DSP and BRAM utilization. Additionally, a software-configurable environment was implemented, and the whole system was tested on an Ultra96-V2 development board utilizing the PYNQ framework. Compared to a CPU implementation, FAQ is more than 13 times faster, including communication overhead caused by transferring the environment onto the FPGA and reading the resulting Q-table. © 2022 IEEE.
dc.description.sponsorship957197; This publication incorporates results from the VEDLIoT project, which received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 957197; Gaisler
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.ispartofProceedings of the International Conference on Application-Specific Systems, Architectures and Processors
dc.subjectDomain specific architectures
dc.subjectDomain-specific architectures
dc.subjectField programmable gate arrays (FPGA)
dc.subjectHigh-throughput
dc.subjectLearning algorithms
dc.subjectMachine Learning
dc.subjectMachine-learning
dc.subjectNASA
dc.subjectPipeline stages
dc.subjectQ-learning
dc.subjectQ-learning algorithms
dc.subjectQ-values
dc.subjectReconfigurable architectures
dc.subjectReconfigurable hardware
dc.subjectReinforcement Learning
dc.subjectReinforcement learning algorithms
dc.subjectReinforcement learnings, Reinforcement learning
dc.subjectSoftware testing, Bit-Width
dc.titleFAQ: A Flexible Accelerator for Q-Learning with Configurable Environment
dc.typeconference paper
dc.identifier.doi10.1109/ASAP54787.2022.00026
dc.identifier.scopus2-s2.0-85140964922
dc.identifier.urlhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85140964922&doi=10.1109%2fASAP54787.2022.00026&partnerID=40&md5=d0d93433951d00e19634419bf07a4e81
dc.description.volume2022-July
dc.description.startpage106
dc.description.endpage114
dcterms.isPartOf.abbreviationProc Int Conf Appl Spec Syst Arcitec Process Proc
crisitem.author.deptFB 06 - Mathematik/Informatik-
crisitem.author.deptidfb06-
crisitem.author.orcid0000-0003-1005-5753-
crisitem.author.parentorgUniversität Osnabrück-
crisitem.author.netidPoMa309-
Zur Kurzanzeige

Seitenaufrufe

4
Letzte Woche
0
Letzter Monat
0
geprüft am 23.05.2024

Google ScholarTM

Prüfen

Altmetric