Google Scholar: citas
EarthView : a large scale remote sensing dataset for self-supervision
Velazquez Dorta, Diego Alejandro (Centre de Visió per Computador)
Rodríguez López, Pau (Centre de Visió per Computador)
Alonso Muñoz, Sergio (Satellogic)
Gonfaus Sitjes, Josep Maria (Satellogic)
Gonzàlez, Jordi (Centre de Visió per Computador)
Richarte, Gerardo (Satellogic)
Marin, Javier (Satellogic)
Bengio, Yoshua (Université de Montréal)
Lacoste, Alexandre (ServiceNow Research)

Fecha: 2025
Resumen: This paper presents EarthView, a comprehensive dataset specifically designed for self-supervision on remote sensing data, intended to enhance deep learning applications on Earth monitoring tasks. The dataset spans 15 tera pixels of global remote-sensing data, combining imagery from a diverse range of sources, including NEON, Sentinel, and a novel release of 1m spatial resolution data from Satellogic. Our dataset provides a wide spectrum of image data with varying resolutions, harnessed from different sensors and organized coherently into an accessible HuggingFace dataset in parquet format. This data spans five years, from 2017 to 2022. Accompanying the dataset, we introduce EarthMAE, a tailored Masked Autoencoder, developed to tackle the distinct challenges of remote sensing data. Trained in a self-supervised fashion, EarthMAE effectively processes different data modalities such as hyperspectral, multispectral, topographical data, segmentation maps, and temporal structure. This model helps us show that pre-training on Satellogic data improves performance on downstream tasks. While there is still a gap to fill in MAE for heterogeneous data, we regard this innovative combination of an expansive, diverse dataset and a versatile model adapted for self-supervised learning as a stride forward in deep learning for Earth monitoring.
Derechos: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, la comunicació pública de l'obra i la creació d'obres derivades, fins i tot amb finalitats comercials, sempre i quan es reconegui l'autoria de l'obra original. Creative Commons
Lengua: Anglès
Documento: Prepublicació ; recerca ; Versió de l'autor
Materia: Cs.cv

DOI: 10.48550/arXiv.2501.08111


13 p, 8.4 MB

El registro aparece en las colecciones:
Documentos de investigación > Prepublicacions

 Registro creado el 2025-04-27, última modificación el 2026-03-09



   Favorit i Compartir