Onto how compression yields energy-efficient text search

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage37es_ES
UDC.grupoInvLaboratorio de Bases de Datos (LBD)es_ES
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicaciónes_ES
UDC.issuearticle 122es_ES
UDC.journalTitleComputinges_ES
UDC.startPage1es_ES
UDC.volume107es_ES
dc.contributor.authorFariña, Antonio
dc.contributor.authorPoy, Olivia
dc.contributor.authorBrisaboa, Nieves R.
dc.contributor.authorCalero, Coral
dc.contributor.authorMoraga, Mª Ángeles
dc.contributor.authorPedreira, Óscar
dc.date.accessioned2025-05-13T15:57:36Z
dc.date.available2025-05-13T15:57:36Z
dc.date.issued2025-04
dc.description.abstract[Abstract]: In the last two decades, word-based text compression has shown to be the key to efficiently handling large collections of text not only due to yielding important storage savings but, more importantly, because it allowed boosting the performance of some traditional text retrieval systems. The reason is that when the appropriate compression techniques are chosen, compressed text search becomes much faster than searches on plain text, and retrieval/decompression could start at any part of the compressed data, hence allowing to keep the text collection compressed all the time. Word-based text compressors have been typically compared in terms of their compression effectiveness, encoding/decoding speed, and performance when searching for words. In this paper, we show that compression also has benefits in terms of energy efficiency when performing word-based searches. Particularly, our experiments considering searches performed over uncompressed text and text compressed with the most well-suited compressors for text databases showed energy savings of around 30–70%. These savings are obtained thanks to improvements in search time but also, rather unexpectedly, because compressed text searches typically require less power from the processor. We have also analyzed how small modifications to the Horspool search algorithm can lead to time savings and can reduce even further the power needs of the processor (up to 5–10%), and, consequently, the overall energy consumption.es_ES
dc.description.sponsorshipThis work was partially funded by MCIN/AEI/10.13039/501100011033 and EU/ ERDF: grant PID2021-122554OB-C33 (OASSIS); by MCIN/AEI/ 10.13039/501100011033 and European Union NextGenerationEU/PRTR: grant TED2021-129245B-C21 (PLAGEMIS). Funding Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. The group from Castilla - La Mancha is funded in part by CECD (JCCM) and FEDER funds: grant SBPLY/21/180501/000115 (EMMA). The group from A Coruña is also funded in part by MCIN/ AEI/10.13039/501100011033 and “NextGenerationEU”/PRTR: grants PDC2021-121239-C31 (FLATCITY-POC), PDC2021-120917-C21 (SIGTRANS), and PID2020-114635RB-I00 (EXTRA-Compact); by GAIN/Xunta de Galicia: grant GRC: ED431C 2021/53; by UE, (ERDF), GAIN, Convocatoria Conecta COVID: grant IN852D 2021/3 (CO3).es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2021/53es_ES
dc.description.sponsorshipXunta de Galicia; IN852D 2021/3es_ES
dc.description.sponsorshipJunta Castilla-La Mancha; SBPLY/21/180501/000115es_ES
dc.identifier.citationFariña, A., Poy, O., Brisaboa, N.R. et al. Onto how compression yields energy-efficient text search. Computing 107, 122 (2025). https://doi.org/10.1007/s00607-025-01469-0es_ES
dc.identifier.doi10.1007/s00607-025-01469-0
dc.identifier.issn1436-5057
dc.identifier.issn0010-485X
dc.identifier.urihttp://hdl.handle.net/2183/41984
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-122554OB-C33/ES/OASSIS-UDC: HACIA ORGANIZACIONES SOFTWARE MÁS SOSTENIBLES: UN ENFOQUE HOLÍSTICO PARA PROMOVER LA SOSTENIBILIDAD ECONÓMICA, HUMANA Y MEDIOAMBIENTALes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLATAFORMA PARA LA GENERACIÓN AUTOMÁTICA DE SISTEMAS DE INFORMACIÓN DE LA MOVILIDAD ENERGÉTICAMENTE EFICIENTES, BASADOS EN ESTRUCTURAS DE DATOS COMPACTAS Y GIS (PLAGEMIS)es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-121239-C31/ES/FLATCITY-BOARD: BACKEND AND DASHBOARD FOR FLATCITYes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PDC2021-120917-C21/ES/SIGTRANS-UDCes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-114635RB-I00/ES/EXPLOTACION ENRIQUECIDA DE TRAYECTORIAS CON ESTRUCTURAS DE DATOS COMPACTAS Y GIS/es_ES
dc.relation.urihttps://doi.org/10.1007/s00607-025-01469-0es_ES
dc.rightsAtribución 4.0 Internacionales_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectText compressiones_ES
dc.subjectDense codeses_ES
dc.subjectHuffman codeses_ES
dc.subjectText searches_ES
dc.subjectEnergy efficiencyes_ES
dc.subjectGreen softwarees_ES
dc.titleOnto how compression yields energy-efficient text searches_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication2fe2b113-791f-4229-a83a-311d0c8b5ce6
relation.isAuthorOfPublication42f2c226-9868-4516-8efd-2cd3c6692034
relation.isAuthorOfPublication21dcfe07-2476-4360-a425-ba1ba4253409
relation.isAuthorOfPublication.latestForDiscovery2fe2b113-791f-4229-a83a-311d0c8b5ce6

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Farina_Antonio_2025_Onto_how_compression_yields_energy_efficient_text_search.pdf
Size:
1.85 MB
Format:
Adobe Portable Document Format
Description: