Estimating the compressibility of raster data

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvLaboratorio de Bases de Datos (LBD)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitleInformation Systems
UDC.startPage102624
UDC.volume2026
dc.contributor.authorMuñoz, Martita
dc.contributor.authorFuentes Sepúlveda, José
dc.contributor.authorHernández, Cecilia
dc.contributor.authorSeco, Diego
dc.date.accessioned2025-11-12T09:13:50Z
dc.date.available2025-11-12T09:13:50Z
dc.date.issued2025-09-23
dc.description©2025 Elsevier B.V. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/bync-nd/4.0/. This version of the article has been accepted for publication in Information Systems. The Version of Record is available online at https://doi.org/10.1016/j.is.2025.102624
dc.description.abstract[Abstract]: The raster data model is widely used in Geographic Information Systems and image processing. The continuous growth of raster data volume poses significant challenges for storage and management. Compact representations of rasters have emerged as a critical solution to address this issue, leveraging data locality to achieve efficient compression. In this context, the research community has proposed compressibility measures aiming to estimate the compressibility of data. Some measures, initially proposed for sequences, have been extended to two- and three-dimensional matrices. This work conducts an experimental analysis of measures applied to raster data compressibility estimation. The first approach applies a linearization function on raster data with matrix representation and then uses existing one-dimensional compressibility measures. The evaluation of the approach compares 1D compressibility measures with 2D measures, data compressors, Compact Data Structures (CDSs), and spatial locality estimation techniques. The results show that spatial locality, alphabet size, and noise directly influence raster compressibility, having more impact over measures like z, v, and g, compressors (bzip, gzip) and a CDS called k2-raster. The second approach introduces δ∆, a 2D compressibility measure sensitive to differences within the alphabet values. Its purpose is to refine the estimation of raster compressibility. Results indicate that δ∆ is affected by the actual values and their frequencies, aligning with the outcomes of some specific compressors. This alignment underscores the suitability of δ∆ for compressibility estimation tasks closely related to those performed by such compressors.
dc.description.sponsorshipThis work was supported by the Agencia Nacional de Investigación y Desarrollo (ANID), Chile [grant numbers 21200810, 11220545, 11240971]; the Centre for Biotechnology and Bioengineering [grant number AFB240001]; the ANID – Millennium Science Initiative Program [grant number ICN17_002]; and PID2022-141027NB-C21 (EarthDL), TED2021-129245B-C21 (PLAGEMIS). Partially funded by MCIN/AEI/10.13039/501100011033 and “NextGenerationEU”/PRTR.
dc.description.sponsorshipChile. Agencia National de Investigación y Desarrollo; 21200810
dc.description.sponsorshipChile. Agencia National de Investigación y Desarrollo; 11220545
dc.description.sponsorshipChile. Agencia National de Investigación y Desarrollo; 11240971
dc.description.sponsorshipCentro de Biotecnología y Bioingeniería (Chile); AFB240001
dc.description.sponsorshipChile. Agencia National de Investigación y Desarrollo; ICN17_002
dc.identifier.citationM. Muñoz, J. Fuentes-Sepúlveda, C. Hernández, and D. Seco , "Estimating the compressibility of raster data", Information Systems, Vol. 136, Feb. 2026, 102624, https://doi.org/10.1016/j.is.2025.102624
dc.identifier.doi10.1016/j.is.2025.102624
dc.identifier.issn1873-6076
dc.identifier.urihttps://hdl.handle.net/2183/46416
dc.language.isoeng
dc.publisherElsevier
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141027NB-C21/ES/MODELADO, DESCUBRIMIENTO, EXPLORACION Y ANALISIS DE DATA LAKES MEDIOAMBIENTALES [UDC]
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLATAFORMA PARA LA GENERACIÓN AUTOMÁTICA DE SISTEMAS DE INFORMACIÓN DE LA MOVILIDAD ENERGÉTICAMENTE EFICIENTES, BASADOS EN ESTRUCTURAS DE DATOS COMPACTAS Y GIS (PLAGEMIS)
dc.relation.urihttps://doi.org/10.1016/j.is.2025.102624
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsembargoed access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectRaster Data Model
dc.subjectRaster Compression
dc.subjectSpace-Filling Curve
dc.subjectCompressibility measures
dc.titleEstimating the compressibility of raster data
dc.typejournal article
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublication205d0115-1d0f-46c4-8581-ea7a69642870
relation.isAuthorOfPublication.latestForDiscovery205d0115-1d0f-46c4-8581-ea7a69642870

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Seco_Diego_2026_Estimating_the_compressibility_of_raster_data.pdf
Size:
1.15 MB
Format:
Adobe Portable Document Format