Classic distance join queries using compact data structures
View/ Open
Use this link to cite
http://hdl.handle.net/2183/36651
Except where otherwise noted, this item's license is described as Atribución-NoComercial 4.0 Internacional
Collections
- Investigación (FIC) [1578]
Metadata
Show full item recordTitle
Classic distance join queries using compact data structuresDate
2024-07Citation
de Bernardo, G., Penabad, M. R., Corral, A., & Brisaboa, N. R. (2024). Classic distance join queries using compact data structures. Information Sciences, 674, 120732. https://doi.org/10.1016/j.ins.2024.120732
Abstract
[Abstract]: Distance-based Join Queries (DJQs) have multiple applications in spatial databases, Geographic Information Systems, and other areas. The K Closest Pairs Query (KCPQ) and the 𝜀� Distance Join Query (𝜀� DJQ) are well-known DJQs that have been widely studied and can be solved using plane-sweep techniques, which are efficient but must keep the whole datasets in main memory. In this work, we propose DJQ algorithms that work with data represented using a k2-tree, a compact data structure for binary grids. Our algorithms solve KCPQ and 𝜀� DJQ queries, as well as several window-constrained variants, taking advantage of the indexing capabilities of K2 -trees to efficiently answer queries without the need to decompress the data. Our experimental evaluation with large datasets shows that k2-tree algorithms are up to 5 times faster than plane-sweep algorithms in KCPQ, and 5–30 times faster in 𝜀� DJQ. In variants that are window-constrained, our algorithms are competitive in most scenarios and faster for large windows. Additionally, our algorithms are not very affected by the distribution of the data and yield much more predictable query times, showing up to 30 times smaller variance in query times than plane sweep, depending on the location of the query window.
Keywords
K closest pairs
ε distance join
Spatial query evaluation
k2-tree
ε distance join
Spatial query evaluation
k2-tree
Editor version
Rights
Atribución-NoComercial 4.0 Internacional
ISSN
0020-0255