Mostrar o rexistro simple do ítem

dc.contributor.authorArroyuelo, Diego
dc.contributor.authorBustos, Benjamin
dc.contributor.authorGómez-Brandón, Adrián
dc.contributor.authorHogan, Aidan
dc.contributor.authorNavarro, Gonzalo
dc.contributor.authorReutter, Juan
dc.date.accessioned2024-07-10T12:40:54Z
dc.date.available2024-07-10T12:40:54Z
dc.date.issued2024-03-06
dc.identifier.citationDiego Arroyuelo, Benjamin Bustos, Adrián Gómez-Brandón, Aidan Hogan, Gonzalo Navarro, and Juan Reutter. 2024. Worst-Case-Optimal Similarity Joins on Graph Databases. Proc. ACM Manag. Data 2, 1, Article 39 (February 2024), 26 pages. https://doi.org/10.1145/3639294es_ES
dc.identifier.issn2836-6573
dc.identifier.urihttp://hdl.handle.net/2183/37882
dc.description© ACM 2024. This is the author's version of the work (accepted manuscript or postprint). It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in Proceedings of the ACM on Management of Data, https:// doi.org/10.1145/3639294es_ES
dc.description.abstract[Absctract]: We extend the concept of worst-case optimal equijoins in graph databases to the case where some nodes are required to be within the k-nearest neighbors (kNN) of others under some similarity function. We model the problem by superimposing the database graph with the kNN graph and show that a variant of Leapfrog TrieJoin (LTJ) implemented over a compact data structure called the Ring can be seamlessly extended to integrate similarity clauses with the equijoins in the LTJ query process, retaining worst-case optimality in many relevant cases. Our experiments on a benchmark that combines Wikidata and IMGpedia show that our enhanced LTJ algorithm outperforms by a considerable margin a baseline that first applies classic LTJ and then completes the query by applying the similarity predicates. The difference is more pronounced on queries where the similarity clauses are more densely connected to the query, becoming of an order of magnitude in some cases.es_ES
dc.description.sponsorshipFunded by ANID – Millennium Science Initiative Program – Code ICN17_002. A.G. is funded in part by MCIN/AEI/10.13039/5011000-11033: grant PID2020-114635RB-I00 (EXTRACompact); by MCIN/A-EI/10.13039/501100011033 and EU/ERDF "A way of making Europe": PID2022-141027NBC21 (EARTHDL); by MCIN/AEI/10.13039/501100011033 and “Next-GenerationEU”/ PRTR: grants TED2021-129245B-C21 (PLAGEMIS), PDC2021-120917-C21 (SIGTRANS) and by GAIN/Xunta de Galicia: GRC: grants ED431C 2021/53, and CIGUS 2023-2026. A.H. was funded in part by Fondecyt Grant 1221926. G.N. was funded in part by Fondecyt Grant 1-230755. J.R. was funded in part by Fondecyt Grant 1221799.es_ES
dc.description.sponsorshipChile. Agencia Nacional de Investigación y Desarrollo; ICN17_002es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2021/53es_ES
dc.description.sponsorshipXunta de Galicia; CIGUS 2023-2026es_ES
dc.description.sponsorshipChile. Fondo Nacional de Desarrollo Científico y Tecnológico (Fondecyt); 1-230755es_ES
dc.description.sponsorshipChile. Fondo Nacional de Desarrollo Científico y Tecnológico (Fondecyt); 1221926es_ES
dc.description.sponsorshipChile. Fondo Nacional de Desarrollo Científico y Tecnológico (Fondecyt); 1221799es_ES
dc.language.isoenges_ES
dc.publisherAssociation for Computing Machinery (ACM)es_ES
dc.relationinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141027NB-C21/ES/MODELADO, DESCUBRIMIENTO, EXPLORACION Y ANALISIS DE DATA LAKES MEDIOAMBIENTALESes_ES
dc.relationinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLAGEMISes_ES
dc.relationinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PDC2021-120917-C21/ES/SIGTRANSes_ES
dc.relation.urihttps://doi.org/10.1145/3639294es_ES
dc.subjectWorst-case optimal joinses_ES
dc.subjectLeapfrog Triejoines_ES
dc.subjectGraph patternses_ES
dc.subjectGraph databaseses_ES
dc.subjectGraph indexinges_ES
dc.subjectSimilarity joinses_ES
dc.subjectNearest-neighbor graphses_ES
dc.titleWorst-Case-Optimal Similarity Joins on Graph Databaseses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleProceedings of the ACM on Management of Dataes_ES
UDC.volume2es_ES
UDC.issue1es_ES
UDC.startPageNº article: 39es_ES
UDC.conferenceTitleACM SIGMOD/PODS International Conference on Management of Dataes_ES


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem