Scopus: 2 cites, Google Scholar: cites
Multimodal stereo from thermal infrared and visible spectrum
Barrera, Fernando

Data: 2014
Resum: Recent advances in thermal infrared imaging (LWIR) has allowed its use in applications beyond of the military domain. Nowadays, this new family of sensors is included in different technical and scientific applications. They offer features that facilitate tasks, such as detection of pedestrians, hot spots, differences in temperature, among others, which can significantly improve the performance of a system where the persons are expected to play the principal role. For instance, video surveillance applications, monitoring, and pedestrian detection. During the dissertation the next question is stated: \textit{Could a couple of sensors measuring different bands of the electromagnetic spectrum, as the visible and thermal infrared, be used to extract depth information?} Although it is a complex question, we shows that a system of these characteristics is possible as well as their advantages, drawbacks, and potential opportunities. In this research an experimental study that compares different cost functions and matching approaches is performed, in order to build a multimodal stereovision system. Furthermore, the common problems in infrared/visible stereo, specially in the outdoor scenes are identified. Our framework summarizes the architecture of a generic stereo algorithm, at different levels: computational, functional, and structural, which can be extended toward high-level fusion (semantic) and high-order (prior). The proposed framework is intended to explore novel multimodal stereo matching approaches, going from sparse to dense representations (both disparity and depth maps). Moreover, context information is added in form of priors and assumptions. Finally, the dissertation shows a promissory way toward the integration of multiple sensors for recovering three-dimensional information.
Nota: Advisors: Angel Sappa, Felipe Lumbreras. Date and location of PhD thesis defense: 29 November 2012, Universitat Autònoma de Barcelona
Drets: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. Creative Commons
Llengua: Anglès
Document: Altres ; recerca ; Versió publicada
Matèria: Computer vision ; Sensor systems ; 3D and Stereo
Publicat a: ELCVIA : Electronic Letters on Computer Vision and Image Analysis, Vol. 13, Núm. 2 (2014) , p. 63-64, ISSN 1577-5097

Adreça original: https://elcvia.cvc.uab.es/article/view/v13-n3-barrera
Adreça alternativa: https://raco.cat/index.php/ELCVIA/article/view/281640
DOI: 10.5565/rev/elcvia.619


2 p, 55.1 KB

El registre apareix a les col·leccions:
Articles > Articles publicats > ELCVIA
Articles > Articles de recerca

 Registre creat el 2014-07-29, darrera modificació el 2024-02-23



   Favorit i Compartir