New insights into evaluation of regression models through a decomposition of the prediction errors : application to near-infrared spectral data
Sánchez Rodríguez, María Isabel (Universidad de Córdoba. Departamento de Estadística, Econometría, Investigación Operativa, Organización de Empresas y Economía Aplicada)
Sánchez López, Elena (Universidad de Córdoba. Departamento de Química Orgánica)
Caridad, José Mª (Universidad de Córdoba. Departamento de Estadística, Econometría, Investigación Operativa, Organización de Empresas y Economía Aplicada)
Marinas, Alberto (Universidad de Córdoba. Departamento de Química Orgánica)
Marinas, Jose Mª (Universidad de Córdoba. Departamento de Química Orgánica)
Urbano, Francisco José (Universidad de Córdoba. Departamento de Química Orgánica)

Date: 2013
Abstract: This paper analyzes the performance of linear regression models taking into account usual criteria such as the number of principal components or latent factors, the goodness of fit or the predictive capability. Other comparison criteria, more common in an economic context, are also considered: the degree of multicollinearity and a decomposition of the mean squared error of the prediction which determines the nature, systematic or random, of the prediction errors. The applications use real data of extra-virgin oil obtained by near-infrared spectroscopy. The high dimensionality of the data is reduced by applying principal component analysis and partial least squares analysis. A possible improvement of these methods by using cluster analysis or the information of the relative maxima of the spectrum is investigated. Finally, obtained results are generalized via cross- validation and bootstrapping.
Rights: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. Creative Commons
Language: Anglès
Document: Article ; recerca ; Versió publicada
Subject: Principal components ; Partial least squares ; Multivariate calibration ; Near-infrared spectroscopy
Published in: SORT : statistics and operations research transactions, Vol. 37, Núm. 1 (January-June 2013) , p. 57-78, ISSN 2013-8830

Adreça alternativa: https://raco.cat/index.php/SORT/article/view/261671


22 p, 3.5 MB

The record appears in these collections:
Articles > Published articles > SORT
Articles > Research articles

 Record created 2013-11-11, last modified 2023-12-22



   Favorit i Compartir