An Experimental Evaluation of k2-Tree on External Memory

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvLaboratorio de Bases de Datos (LBD)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitleSoftware: Practice and Experience
UDC.volume2025
dc.contributor.authorGutiérrez Retamal, Gilberto
dc.contributor.authorRomero, Miguel
dc.contributor.authorRodríguez Penabad, Miguel
dc.contributor.authorSantolaya, Fernando
dc.contributor.authorCaniupán, Mónica
dc.contributor.authorTorres, Rodrigo
dc.date.accessioned2025-11-11T08:18:25Z
dc.date.available2025-11-11T08:18:25Z
dc.date.issued2025-10-24
dc.descriptionThis is the peer reviewed version of the following article: G. Gutiérrez, M. Romero, M. R. Penabad, F. Santolaya, M. Caniupán, and Rodrigo Torres, "An Experimental Evaluation of k2-Tree on External Memory", Software: Practice and Experience, 2025, which has been published in final form at https://doi.org/10.1002/spe.70028. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions. This article may not be enhanced, enriched or otherwise transformed into a derivative work, without express permission from Wiley or by statutory rights under applicable legislation. Copyright notices must not be removed, obscured or modified. The article must be linked to Wiley’s version of record on Wiley Online Library and any embedding, framing or otherwise making available the article or pages thereof by third parties from platforms, services and websites other than Wiley Online Library must be prohibited.
dc.description.abstract[Abstract]: The k2-tree compact data structure has gained great popularity to represent binary relationships in main memory. It presents a good performance and a good trade-off between storage and execution time. However, datasets being too large, or limited resources, may prevent the dataset from fitting into RAM even in compressed form. This work presents an experimental evaluation of a k2-tree in external memory (disk), in terms data access (I/O operation or cache misses) and execution time for 4 types of common queries. We compare the k2-tree with other data structures, namely a Quadtree (specifically, a Linear QuadTree, LQT) and the classical adjacency matrix, all of them being in external memory. We used for the test both synthetical as well as large, real world, datasets. Several aspects, such as the size of the memory cache and its replacement scheme, or specific parameters like the arity (k value) of the k2-tree were considered in the experiments. In terms of storage needs, the k2-tree clearly outperforms the other alternatives, in extreme cases needing only 1% of the LQT space, or 0.01% of the adjacency matrix (and for some large datasets neither the LQT nor the adjacency matrix could be built). In terms of performance, except for cases with small datasets, where the adjacency matrix is the best option for some queries, the k2-tree is either competitive or outperforms the alternatives.
dc.description.sponsorshipThis work was supported by ANID Grant 1230647, ALBA group code 2130591 GI/VC, Project INES I + D 22-14, and Projects 2130520IF/R and 2230534IF/R (funded by DICREA/UBB). Miguel R. Penabad's work was partially funded by MCIN/AEI/10.13039/501100011033 and NextGenerationEU/PRTR projects (Grants TED2021-129245B-C21 (PLAGEMIS), PDC2021-121239-C31 (FLATCITY-POC), and PID2020-114635RB-I00 (EXTRACompact)); by GAIN/Xunta de Galicia project GRC: ED431C 2021/53; CITIC is funded by the Xunta de Galicia.
dc.description.sponsorshipChile. Fondo Nacional de Desarrollo Científico y Tecnológico; 1230647
dc.description.sponsorshipUniversidad del Bío-Bío; 2130591 GI/VC
dc.description.sponsorshipChile. Agencia Nacional de Investigación y Desarrollo (ANID) – Programa InES I+D; 22-14
dc.description.sponsorshipUniversidad del Bío-Bío; 2230534 IF/R
dc.description.sponsorshipUniversidad del Bío-Bío; 2130520 IF/R
dc.description.sponsorshipXunta de Galicia; ED431C 2021/53
dc.identifier.citationG. Gutiérrez, M. Romero, M. R. Penabad, F. Santolaya, M. Caniupán, and Rodrigo Torres, "An Experimental Evaluation of k2-Tree on External Memory", Software: Practice and Experience, 2025, https://doi.org/10.1002/spe.70028
dc.identifier.doi10.1002/spe.70028
dc.identifier.issn1097-024X
dc.identifier.urihttps://hdl.handle.net/2183/46385
dc.language.isoeng
dc.publisherWiley
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLATAFORMA PARA LA GENERACIÓN AUTOMÁTICA DE SISTEMAS DE INFORMACIÓN DE LA MOVILIDAD ENERGÉTICAMENTE EFICIENTES, BASADOS EN ESTRUCTURAS DE DATOS COMPACTAS Y GIS (PLAGEMIS)
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-121239-C31/ES/FLATCITY-BOARD: BACKEND AND DASHBOARD FOR FLATCITY
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-114635RB-I00/ES/EXPLOTACION ENRIQUECIDA DE TRAYECTORIAS CON ESTRUCTURAS DE DATOS COMPACTAS Y GIS/
dc.relation.urihttps://doi.org/10.1002/spe.70028
dc.rights© 2025 John Wiley & Sons Ltd.
dc.rights.accessRightsembargoed access
dc.subjectExternal memory
dc.subjectCompact data structures
dc.subjectk2-tree
dc.titleAn Experimental Evaluation of k2-Tree on External Memory
dc.typejournal article
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublicationa30cb84d-2ead-45c1-822a-e1b0741abd78
relation.isAuthorOfPublication.latestForDiscoverya30cb84d-2ead-45c1-822a-e1b0741abd78

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Penabad_MiguelR_2025_An_Experimental_Evaluation_of_k2_Tree_on_External_Memory.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format