Web of Science: 1 citas, Scopus: 2 citas, Google Scholar: citas
The space of models in machine learning : using Markov chains to model transitions
Torra, Vicenç (Umeå University. Department Computing Science)
Taha, Mariam (Umeå University. Department Computing Science)
Navarro-Arribas, Guillermo (Universitat Autònoma de Barcelona. Departament d'Enginyeria de la Informació i de les Comunicacions)

Fecha: 2021
Resumen: Machine and statistical learning is about constructing models from data. Data is usually understood as a set of records, a database. Nevertheless, databases are not static but change over time. We can understand this as follows: there is a space of possible databases and a database during its lifetime transits this space. Therefore, we may consider transitions between databases, and the database space. NoSQL databases also fit with this representation. In addition, when we learn models from databases, we can also consider the space of models. Naturally, there are relationships between the space of data and the space of models. Any transition in the space of data may correspond to a transition in the space of models. We argue that a better understanding of the space of data and the space of models, as well as the relationships between these two spaces is basic for machine and statistical learning. The relationship between these two spaces can be exploited in several contexts as, e. g. , in model selection and data privacy. We consider that this relationship between spaces is also fundamental to understand generalization and overfitting. In this paper, we develop these ideas. Then, we consider a distance on the space of models based on a distance on the space of data. More particularly, we consider distance distribution functions and probabilistic metric spaces on the space of data and the space of models. Our modelization of changes in databases is based on Markov chains and transition matrices. This modelization is used in the definition of distances. We provide examples of our definitions.
Ayudas: Agencia Estatal de Investigación TIN2017-87211-R
Derechos: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, la comunicació pública de l'obra i la creació d'obres derivades, fins i tot amb finalitats comercials, sempre i quan es reconegui l'autoria de l'obra original. Creative Commons
Lengua: Anglès
Documento: Article ; recerca ; Versió publicada
Materia: Hypothesis space ; Machine and statistical learning models ; Probabilistic metric spaces ; Space of data ; Space of models
Publicado en: Progress in Artificial Intelligence, Vol. 10, Issue 3 (September 2021) , p. 321-332, ISSN 2192-6360

DOI: 10.1007/s13748-021-00242-6


12 p, 537.4 KB

El registro aparece en las colecciones:
Documentos de investigación > Documentos de los grupos de investigación de la UAB > Centros y grupos de investigación (producción científica) > Ingeniería > Combinatorics, Coding and Security Group (CCSG)
Artículos > Artículos de investigación
Artículos > Artículos publicados

 Registro creado el 2023-07-18, última modificación el 2023-07-29



   Favorit i Compartir