Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision

DC ElementWertSprache
dc.contributor.authorLukanov, Hristofor
dc.contributor.authorKoenig, Peter
dc.contributor.authorPipa, Gordon
dc.date.accessioned2021-12-23T16:12:29Z-
dc.date.available2021-12-23T16:12:29Z-
dc.date.issued2021
dc.identifier.urihttps://osnascholar.ub.uni-osnabrueck.de/handle/unios/10240-
dc.description.abstractWhile abundant in biology, foveated vision is nearly absent from computational models and especially deep learning architectures. Despite considerable hardware improvements, training deep neural networks still presents a challenge and constraints complexity of models. Here we propose an end-to-end neural model for foveal-peripheral vision, inspired by retino-cortical mapping in primates and humans. Our model has an efficient sampling technique for compressing the visual signal such that a small portion of the scene is perceived in high resolution while a large field of view is maintained in low resolution. An attention mechanism for performing ``eye-movements'' assists the agent in collecting detailed information incrementally from the observed scene. Our model achieves comparable results to a similar neural architecture trained on full-resolution data for image classification and outperforms it at video classification tasks. At the same time, because of the smaller size of its input, it can reduce computational effort tenfold and uses several times less memory. Moreover, we present an easy to implement bottom-up and top-down attention mechanism which relies on task-relevant features and is therefore a convenient byproduct of the main architecture. Apart from its computational efficiency, the presented work provides means for exploring active vision for agent training in simulated environments and anthropomorphic robotics.
dc.description.sponsorshipDeutsche Forschungsgemeinschaft (DFG), GermanyGerman Research Foundation (DFG) [GRK2340]; The project was financed by the funds of a research training group ``Computational Cognition'' (GRK2340) provided by the Deutsche Forschungsgemeinschaft (DFG), Germany.
dc.language.isoen
dc.publisherFRONTIERS MEDIA SA
dc.relation.ispartofFRONTIERS IN COMPUTATIONAL NEUROSCIENCE
dc.subjectactive vision
dc.subjectATTENTION
dc.subjectbottom-up attention
dc.subjectdeep learning-artificial neural network (DL-ANN)
dc.subjectFIELD
dc.subjectfoveal vision
dc.subjectGANGLION-CELLS
dc.subjectMathematical & Computational Biology
dc.subjectNEURAL-NETWORK
dc.subjectNeurosciences
dc.subjectNeurosciences & Neurology
dc.subjectperipheral vision
dc.subjectspace-variant vision
dc.subjecttop-down attention
dc.titleBiologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision
dc.typejournal article
dc.identifier.doi10.3389/fncom.2021.746204
dc.identifier.isiISI:000727185900001
dc.description.volume15
dc.identifier.eissn16625188
dc.publisher.placeAVENUE DU TRIBUNAL FEDERAL 34, LAUSANNE, CH-1015, SWITZERLAND
dcterms.isPartOf.abbreviationFront. Comput. Neurosci.
dcterms.oaStatusGreen Published, gold
crisitem.author.deptInstitut für Kognitionswissenschaft-
crisitem.author.deptFB 05 - Biologie/Chemie-
crisitem.author.deptInstitut für Kognitionswissenschaft-
crisitem.author.deptidinstitute28-
crisitem.author.deptidfb05-
crisitem.author.deptidinstitute28-
crisitem.author.orcid0000-0003-3654-5267-
crisitem.author.orcid0000-0002-3416-2652-
crisitem.author.parentorgFB 08 - Humanwissenschaften-
crisitem.author.parentorgUniversität Osnabrück-
crisitem.author.parentorgFB 08 - Humanwissenschaften-
crisitem.author.grandparentorgUniversität Osnabrück-
crisitem.author.grandparentorgUniversität Osnabrück-
crisitem.author.netidKoPe298-
crisitem.author.netidPiGo340-
Zur Kurzanzeige

Google ScholarTM

Prüfen

Altmetric