Scopus: 8 cites, Google Scholar: cites,
Detection and Classification of Multiple Objects using an RGB-D Sensor and Linear Spatial Pyramid Matching
Dimitriou, Michalis (Technological Educational Institute of Crete. Applied Informatics and Multimedia Department)
Kounalakis, Tsampikos (Brunel University. Departament of Electronic and Computer Engineering)
Vidakis, Nikolaos (Technological Educational Institute of Crete. Applied Informatics and Multimedia Department)
Triantafyllidis, Georgios (Aalborg University Copenhagen. Medialogy Section)

Data: 2013
Resum: This paper presents a complete system for multiple object detection and classification in a 3D scene using an RGB-D sensor such as the Microsoft Kinect sensor. Successful multiple object detection and classification are crucial features in many 3D computer vision applications. The main goal is making machines see and understand objects like humans do. To this goal, the new RGB-D sensors can be utilized since they provide real-time depth map which can be used along with the RGB images for our tasks. In our system we employ effective depth map processing techniques, along with edge detection, connected components detection and filtering approaches, in order to design a complete image processing algorithm for efficient object detection of multiple individual objects in a single scene, even in complex scenes with many objects. Besides, we apply the Linear Spatial Pyramid Matching (LSPM) [1] method proposed by Jianchao Yang et al for the efficient classification of the detected objects. Experimental results are presented for both detection and classification, showing the efficiency of the proposed design.
Drets: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. Creative Commons
Llengua: Anglès
Document: Article ; recerca ; Versió publicada
Matèria: Depth Map ; Object Detection ; Microsoft Kinect ; Image Segmentation ; Feature Extraction ; Classification ; Linear Spatial Pyramid Matching
Publicat a: ELCVIA : Electronic Letters on Computer Vision and Image Analysis, Vol. 12, Núm. 2 (2013) , p. 78-87, ISSN 1577-5097

Adreça original: https://elcvia.cvc.uab.es/article/view/v12-n2-triantafyllidis-dimitriou-kounalakis-et-al
Adreça alternativa: https://raco.cat/index.php/ELCVIA/article/view/280910
DOI: 10.5565/rev/elcvia.523


10 p, 3.3 MB

El registre apareix a les col·leccions:
Articles > Articles publicats > ELCVIA
Articles > Articles de recerca

 Registre creat el 2013-11-11, darrera modificació el 2021-12-11



   Favorit i Compartir