Zombit: Exploiting Runs in Bitvectors
| UDC.coleccion | Investigación | |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | |
| UDC.endPage | 18 | |
| UDC.grupoInv | Laboratorio de Bases de Datos (LBD) | |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | |
| UDC.journalTitle | Software: Practice and Experience | |
| UDC.startPage | 1 | |
| UDC.volume | 2025 | |
| dc.contributor.author | Gómez-Brandón, Adrián | |
| dc.date.accessioned | 2025-09-24T11:16:36Z | |
| dc.date.available | 2025-09-24T11:16:36Z | |
| dc.date.issued | 2025-09-06 | |
| dc.description | The data that support the findings of this study are openly available in zombit-vector at https://github.com/adriangbrandon/zombit-vector. Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG | |
| dc.description.abstract | [Abstract]: Introduction: Bitvectors are a fundamental building block of many compact data structures. In this work, we propose a new compressed representation for bitvectors, named zombit-vector, which compresses bitvectors with 𝑘 runs by splitting the bitvector into fixed-length blocks and classifying them into three possible types depending on their content. Methods: It supports the typical operations over bitvectors in 𝑂(1) time, but select in 𝑂(log 𝑛) time. The total space required for this structure is 𝑂(√𝑘𝑛) + 𝑜(√𝑘𝑛) bits. In addition, we introduce an extension pzombit-vector where the blocks have variablelength. The variable-length partitioning adapts better to the distribution of the data, reducing space needs but requiring 𝑂(log 𝑛) time for all operations. Results: We include experiments on synthetic data and over two real-world scenarios. The synthetic scenario shows the competitiveness of our techniques against well-known state-of-the-art alternatives. That experiment confirms that pzombit-vector is able to use one-tenth of zombit-vector, but becomes 5 times slower. Our experiments with real applications demonstrate that the proposed techniques are practical and applicable in real-world scenarios. | |
| dc.description.sponsorship | Funding: Ministerio de Ciencia e Innovación, Grant/Award Nos: PID2020-114635RB-I00; TED2021-129245B-C21; PID2022-141027NB-C21; Xunta de Galicia, Grant/Award No: ED431C 2025/34; and CITIC receives subsidies from Xunta de Galicia and FEDER Galicia (Ref. ED431G 2023/01). Funding for open access charge: Universidade da Coruña/CISUG. | |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2025/34 | |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2023/01 | |
| dc.identifier.citation | A. Gómez-Brandón, "Zombit: Exploiting Runs in Bitvectors", Software: Practice and Experience, 2025, pp. 1–18, https://doi.org/10.1002/spe.70019 | |
| dc.identifier.doi | 10.1002/spe.70019 | |
| dc.identifier.issn | 1097-024X | |
| dc.identifier.issn | 0038-0644 | |
| dc.identifier.uri | https://hdl.handle.net/2183/45808 | |
| dc.language.iso | eng | |
| dc.publisher | Wiley | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-114635RB-I00/ES/EXPLOTACION ENRIQUECIDA DE TRAYECTORIAS CON ESTRUCTURAS DE DATOS COMPACTAS Y GIS/ | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLATAFORMA PARA LA GENERACIÓN AUTOMÁTICA DE SISTEMAS DE INFORMACIÓN DE LA MOVILIDAD ENERGÉTICAMENTE EFICIENTES, BASADOS EN ESTRUCTURAS DE DATOS COMPACTAS Y GIS (PLAGEMIS) | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica, Técnica y de Innovación 2021-2023/PID2022-141027NB-C21/ES/MODELADO, DESCUBRIMIENTO, EXPLORACION Y ANALISIS DE DATA LAKES MEDIOAMBIENTALES | |
| dc.relation.uri | https://doi.org/10.1002/spe.70019 | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | bitvectors | |
| dc.subject | compact data structures | |
| dc.subject | bitvectors; compact data structures; compression; indices | |
| dc.subject | indices | |
| dc.title | Zombit: Exploiting Runs in Bitvectors | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 1a99c615-806a-48b0-8e5f-7772467f275d | |
| relation.isAuthorOfPublication.latestForDiscovery | 1a99c615-806a-48b0-8e5f-7772467f275d |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- GomezBrandon_Adrian_2025_Zombit.pdf
- Size:
- 1.09 MB
- Format:
- Adobe Portable Document Format

