Bio-inspired architecture for deriving 3D models from video sequences

Autor(en): Schöning, J. 
Heidemann, G. 
Herausgeber: Ma, K.-K.
Lu, J.
Chen, C.-S.
Stichwörter: Computer vision; Content based retrieval; Object recognition; Video recording; Video signal processing, Bio-inspired architectures; Bio-inspired processing; Human machine interaction; Interactive 3d reconstruction; Multiple view geometry; Pictorial representation; Position information; Prototype implementations, Computer architecture
Erscheinungsdatum: 2017
Herausgeber: Springer Verlag
Journal: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 10117 LNCS
Startseite: 62
Seitenende: 76
Zusammenfassung: 
In an everyday context, automatic or interactive 3D reconstruction of objects from one or several videos is not yet possible. Humans, on the contrary, are capable of recognizing the 3D shape of objects even in complex video sequences. To enable machines for doing the same, we propose a bio-inspired processing architecture, which is motivated by the human visual system and converts video data into 3D representations. Similar to the hierarchy of the ventral stream, our process reduces the influence of the position information in the video sequences by object recognition and represents the object of interest as multiple pictorial representations. These multiple pictorial representations are showing 2D projections of the object of interest from different perspectives. Thus, a 3D point cloud can be obtained by multiple view geometry algorithms. In the course of a detailed presentation of this architecture, we additionally highlight existing analogies to the view-combination scheme. The potency of our architecture is shown by reconstructing a car out of two video sequences. In case the automatic processing cannot complete the task, the user is put in the loop to solve the problem interactively. This human-machine interaction facilitates a prototype implementation of the architecture, which can reconstruct 3D objects out of one or several videos. In conclusion, the strengths and limitations of our approach are discussed, followed by an outlook to future work to improve the architecture. © Springer International Publishing AG 2017.
Beschreibung: 
Conference of 13th Asian Conference on Computer Vision, ACCV 2016 ; Conference Date: 20 November 2016 Through 24 November 2016; Conference Code:189839
ISBN: 9783319544267
ISSN: 03029743
DOI: 10.1007/978-3-319-54427-4_5
Externe URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85016118480&doi=10.1007%2f978-3-319-54427-4_5&partnerID=40&md5=953d9ffa9b8b168d59f8d6e940858352

Zur Langanzeige

Seitenaufrufe

1
Letzte Woche
0
Letzter Monat
1
geprüft am 18.05.2024

Google ScholarTM

Prüfen

Altmetric