Classic distance join queries using compact data structures

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage21es_ES
UDC.grupoInvLaboratorio de Bases de Datos (LBD)es_ES
UDC.journalTitleInformation Scienceses_ES
UDC.startPage1es_ES
UDC.volume674es_ES
dc.contributor.authorBernardo, Guillermo de
dc.contributor.authorPenabad, Miguel R.
dc.contributor.authorCorral, Antonio
dc.contributor.authorBrisaboa, Nieves R.
dc.date.accessioned2024-05-27T17:58:23Z
dc.date.available2024-05-27T17:58:23Z
dc.date.issued2024-07
dc.description.abstract[Abstract]: Distance-based Join Queries (DJQs) have multiple applications in spatial databases, Geographic Information Systems, and other areas. The K Closest Pairs Query (KCPQ) and the 𝜀� Distance Join Query (𝜀� DJQ) are well-known DJQs that have been widely studied and can be solved using plane-sweep techniques, which are efficient but must keep the whole datasets in main memory. In this work, we propose DJQ algorithms that work with data represented using a k2-tree, a compact data structure for binary grids. Our algorithms solve KCPQ and 𝜀� DJQ queries, as well as several window-constrained variants, taking advantage of the indexing capabilities of K2 -trees to efficiently answer queries without the need to decompress the data. Our experimental evaluation with large datasets shows that k2-tree algorithms are up to 5 times faster than plane-sweep algorithms in KCPQ, and 5–30 times faster in 𝜀� DJQ. In variants that are window-constrained, our algorithms are competitive in most scenarios and faster for large windows. Additionally, our algorithms are not very affected by the distribution of the data and yield much more predictable query times, showing up to 30 times smaller variance in query times than plane sweep, depending on the location of the query window.es_ES
dc.description.sponsorshipGuillermo de Bernardo, Miguel R. Penabad and Nieves R. Brisaboa are partially funded by MCIN/AEI/10.13039/501100011033 and EU/ERDF A way of making Europe [PID2021-122554OB-C33 (OASSIS)]; by MCIN/AEI/10.13039/501100011033 [PID2020-114635RB-I00 (EXTRACompact), PID2019-105221RB-C41 (MAGIST)]; by MCIN/AEI/10.13039/501100011033 and “NextGenerationEU”/PRTR [TED2021-129245B-C21 (PLAGEMIS), PDC2021-121239-C31 (FLATCITY-POC), PDC2021-120917-C21 (SIGTRANS)]; and by GAIN/Xunta de Galicia [ED431C 2021/53]. CITIC is funded by the Xunta de Galicia through the collaboration agreement between the Department of Culture, Education, Vocational Training and Universities and the Galician universities for the reinforcement of the research centers of the Galician University System (CIGUS) [ED431G 2019/01 (CSI)]. The work by Antonio Corral was partially funded by the EU ERDF and the Andalusian Government (Spain) under the project UrbanITA (ref. PY20_00809) and grant PID2021-124124OB-I00 funded by the Spanish Government MCIN/AEI/10.13039/501100011033 and the “ERDF A way of making Europe”. Funding for open access charge: Universidade da Coruña/CISUG.es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2021/53es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2019/01es_ES
dc.description.sponsorshipFinanciado para publicación en acceso aberto: Universidade da Coruña/CISUGes_ES
dc.identifier.citationde Bernardo, G., Penabad, M. R., Corral, A., & Brisaboa, N. R. (2024). Classic distance join queries using compact data structures. Information Sciences, 674, 120732. https://doi.org/10.1016/j.ins.2024.120732es_ES
dc.identifier.doi10.1016/j.ins.2024.120732
dc.identifier.issn0020-0255
dc.identifier.urihttp://hdl.handle.net/2183/36651
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-122554OB-C33/ES/OASSIS-UDC: HACIA ORGANIZACIONES SOFTWARE MÁS SOSTENIBLES: UN ENFOQUE HOLÍSTICO PARA PROMOVER LA SOSTENIBILIDAD ECONÓMICA, HUMANA Y MEDIOAMBIENTALes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-114635RB-I00/ES/EXPLOTACION ENRIQUECIDA DE TRAYECTORIAS CON ESTRUCTURAS DE DATOS COMPACTAS Y GIS/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-105221RB-C41/ES/VISUALIZACION Y EXPLORACION BASADA EN FLUJOS Y ANALITICA DE BIG DATA ESPACIAL/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-124124OB-I00/ES/UN MARCO FORMAL BASADO EN LA WEB DE LAS COSAS Y COMPUTACION EDGE PARA LA DEFINICION, EL DESCUBRIMIENTO Y EL PROCESAMIENTO DE DATOS DE COMPONENTES CIBERFISICOSes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLATAFORMA PARA LA GENERACIÓN AUTOMÁTICA DE SISTEMAS DE INFORMACIÓN DE LA MOVILIDAD ENERGÉTICAMENTE EFICIENTES, BASADOS EN ESTRUCTURAS DE DATOS COMPACTAS Y GIS (PLAGEMIS)es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-121239-C31/ES/FLATCITY-BOARD: BACKEND AND DASHBOARD FOR FLATCITYes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-120917-C21/ES/SIGTRANS-UDCes_ES
dc.relation.urihttps://doi.org/10.1016/j.ins.2024.120732es_ES
dc.rightsAtribución-NoComercial 4.0 Internacionales_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/es/*
dc.subjectK closest pairses_ES
dc.subjectε distance joines_ES
dc.subjectSpatial query evaluationes_ES
dc.subjectk2-treees_ES
dc.titleClassic distance join queries using compact data structureses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication23354397-ec74-4cbb-93ac-f85352e9fbd8
relation.isAuthorOfPublication42f2c226-9868-4516-8efd-2cd3c6692034
relation.isAuthorOfPublication.latestForDiscovery23354397-ec74-4cbb-93ac-f85352e9fbd8

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Bernardo_Guillermo_de_2024_Classic_distance_join_queries_using_compact_data_structures.pdf
Size:
1.66 MB
Format:
Adobe Portable Document Format
Description: