Selection of relevant information to improve Image Classification using Bag of Visual Words

Fidalgo Fernández, Eduardo

doi:10.5565/rev/elcvia.1102

Bibliographic citation -- Permanent link: https://ddd.uab.cat/record/188742

Scopus: 0 citations, Google Scholar: citations

Selection of relevant information to improve Image Classification using Bag of Visual Words
Fidalgo Fernández, Eduardo
Alegre Gutiérrez, Enrique dir. (Universidad de León. Departamento de Ingeniería Eléctrica y de Sistemas y Automática)
González-Castro, Víctor dir. (Universidad de León. Departamento de Ingeniería Eléctrica y de Sistemas y Automática)

Date:	2017
Abstract:	One of the main challenges in computer vision is image classification. Nowadays the number of images increases exponentially every day; therefore, it is important to classify them in a reliable way. The conventional image classification pipeline usually consists on extracting local image features, encoding them as a feature vector and classify them using a previously created model. With regards to feature codification, the Bag of Words model and its extensions, such as pyramid matching and weighted schemes, have achieved quite good results and have become the state of the art methods. The process as mentioned above is not perfect and computers, as well as humans, may make mistakes in any of the steps, causing a performance drop in classification. Some of the primary sources of error on large-scale image classification are the presence of multiple objects in the image, small or very thin objects, incorrect annotations or fine-grained recognition tasks among others. Based on those problems and the steps of a typical image classification pipeline, the motivation of this PhD thesis was to provide some guidelines to improve the quality of the extracted features to obtain better classification results. The contributions of the PhD thesis demonstrated how a good feature selection can contribute to improving the fine-grained classification, and that there would even be no need to have a big training data set to learn the key features of each class and to predict with good results.
Rights:	Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.
Language:	Anglès
Document:	Altres ; recerca ; Versió publicada
Subject:	Computer vision ; Features and image descriptors ; Object description and recognition ; Suppor vector machines and kernels ; Image analysis and processing ; Shape extraction and representation
Published in:	ELCVIA, Vol. 16, Num. 2 (2017) , p. 5-8 (Special Issue on Recent PhD Thesis Dissemination (2017)) , ISSN 1577-5097

Adreça original: https://elcvia.cvc.uab.es/article/view/v16-n2-fidalgo
Adreça alternativa: https://raco.cat/index.php/ELCVIA/article/view/v16-n2-fidalgo
Adreça original: https://elcvia.cvc.uab.cat/article/view/v16-n2-fidalgo
DOI: 10.5565/rev/elcvia.1102

4 p, 515.1 KB

The record appears in these collections:
Articles > Published articles > ELCVIA
Articles > Research articles

Record created 2018-04-09, last modified 2026-04-05

Similar records

Add to personal basket
Export as Citation, BibTeX, MARC, MARCXML, DC, EDM OpenAire4