Improved Compressed String Dictionaries

Use este enlace para citar
http://hdl.handle.net/2183/24435Coleccións
- Investigación (FIC) [1685]
Metadatos
Mostrar o rexistro completo do ítemTítulo
Improved Compressed String DictionariesTítulo(s) alternativo(s)
CIKM '19 Proceedings of the 28th ACM International Conference on Information and Knowledge ManagementData
2019-11-03Cita bibliográfica
Brisaboa, Nieves R., et al. Improved Compressed String Dictionaries. En Proceedings of the 28th ACM International Conference on Information and Knowledge Management. ACM, 2019. p. 29-38. Doi: 10.1145/3357384.3357972
Resumo
[Abstract] We introduce a new family of compressed data structures to efficiently store and query large string dictionaries in main memory. Our main technique is a combination of hierarchical Front-coding with ideas from longest-common-prefix computation in suffix arrays. Our data structures yield relevant space-time tradeoffs in real-world dictionaries. We focus on two domains where string dictionaries are extensively used and efficient compression is required: URL collections, a key element in Web graphs and applications such as Web mining; and collections of URIs and literals, the basic components of RDF datasets. Our experiments show that our data structures achieve better compression than the state-of-the-art alternatives while providing very competitive query times.
Palabras chave
Compression
Data structures
String dictionaries
Data structures
String dictionaries