DepreSym: A Depression Symptom Annotated Corpus and the Role of Large Language Models as Assessors of Psychological Markers

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.endPage2762
UDC.grupoInvInformation Retrieval Lab (IRlab)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitleLanguage Resources and Evaluation
UDC.startPage2737
UDC.volume59
dc.contributor.authorPérez, Anxo
dc.contributor.authorFernández-Pichel; Marcos
dc.contributor.authorParapar, Javier
dc.contributor.authorLosada, David E.
dc.date.accessioned2025-10-09T09:12:26Z
dc.date.available2025-10-09T09:12:26Z
dc.date.issued2025-09
dc.descriptionDepreSym can be obtained upon request, and we have adhered to strict ethical standards, particularly those about the ethical development of AI. Financiado para publicación en acceso aberto: CRUE/CSIC con Springer
dc.description.abstract[Abstract]: Computational methods for depression detection aim to mine traces of depression from online publications posted by Internet users. However, solutions trained on existing collections exhibit limited generalisation and interpretability. To tackle these issues, recent studies have shown that identifying specific depressive symptoms can lead to more robust and effective models. The eRisk initiative fosters research on this area and has recently proposed a new ranking task focused on developing search methods to find sentences related to depressive symptoms. This search challenge relies on the symptoms specified by the Beck Depression Inventory-II (BDI-II), a questionnaire widely used in clinical practice. It includes symptoms such as sadness, irritability or lack of sleep. Given the input submitted by systems participating in eRisk, we first apply top-k pooling over the systems’ relevance rankings, obtaining a diverse set of sentences. These sentences are judged for relevance, leading to DepreSym, a dataset consisting of 21,580 sentences annotated according to their relevance to the 21 BDI-II symptoms. This dataset serves as a valuable resource for advancing the development of models that monitor depression markers. Due to the complex nature of this relevance annotation, we designed a robust assessment methodology carried out by three expert assessors, including a trained psychologist. As part of this study, we explore the potential of recent Large Language Models (ChatGPT, GPT4 and Vicuna) as assessors in this complex task. We undertake a comprehensive examination of the LLMs’ performance, studying their main limitations and analysing their role as a complement or replacement for human annotators. Finally, we incorporate our dataset into the Benchmarking Information Retrieval (BEIR) framework for a thorough search evaluation. We use state-of-the-art retrieval systems, including lexical, sparse, dense and re-ranking architectures, to gain insights about the dataset’s complexity and identify potential avenues for improvement.
dc.description.sponsorshipOpen Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. The first and third authors has received support from projects: PLEC2021-007662 (MCIN/AEI/10.13039/501100011033 Ministerio de Ciencia e Innovación, European Union Next Generation EU/PRTR) and also PID2022-137061OB-C21 (MCIN/AEI/10.13039/501100011033, Ministerio de Ciencia e Innovación). We also thank the financial support supplied by the Consellería de Cultura, Educación, Formación Profesional e Universidades (accreditation 2019-2022 ED431G/01, ED431B 2022/33) and the European Regional Development Fund, which acknowledges the CITIC Research Center. The second and fourth authors thank the financial support supplied by MICIU/AEI/10.13039/501100011033 (PID2022-137061OB-C22, supported by ERDF) and Xunta de Galicia-Consellería de Cultura, Educación, Formación Profesional e Universidades (ED431G 2023/04, ED431C 2022/19, supported by ERDF).
dc.description.sponsorshipXunta de Galicia; ED431G/01 2019/2022
dc.description.sponsorshipXunta de Galicia; ED431B 2022/33
dc.description.sponsorshipXunta de Galicia; ED431G 2023/04
dc.description.sponsorshipXunta de Galicia; ED431C 2022/19
dc.identifier.citationPérez, A., Fernández-Pichel, M., Parapar, J. et al. DepreSym: A Depression Symptom Annotated Corpus and the Role of Large Language Models as Assessors of Psychological Markers. Lang Resources & Evaluation 59, 2737–2762 (2025). https://doi.org/10.1007/s10579-025-09831-6
dc.identifier.doi10.1007/s10579-025-09831-6
dc.identifier.issn1574-0218
dc.identifier.urihttps://hdl.handle.net/2183/45934
dc.language.isoeng
dc.publisherSpringer Nature
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PLEC2021-007662/ES/BIG-eRISK: PREDICCIÓN TEMPRANA DE RIESGOS PERSONALES EN CONJUNTOS DE DATOS MASIVOS
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-137061OB-C21/ES/BUSQUEDA, SELECCION Y ORGANIZACION DE CONTENIDOS PARA NECESIDADES DE INFORMACION RELACIONADAS CON LA SALUD - CONSTRUCCION DE RECURSOS Y PERSONALIZACION
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-137061OB-C22/ES/BUSQUEDA, SELECCION Y ORGANIZACION DE CONTENIDOS PARA NECESIDADES DE INFORMACION RELACIONADAS CON LA SALUD: BUSQUEDA Y DETECCION DE DESINFORMACIION
dc.relation.urihttps://doi.org/10.1007/s10579-025-09831-6
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDepression
dc.subjectSocial media mining
dc.subjectLarge language models
dc.subjectSearch
dc.subjectInformation retrieval
dc.titleDepreSym: A Depression Symptom Annotated Corpus and the Role of Large Language Models as Assessors of Psychological Markers
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublicationc673c8b1-1afc-48f6-85e9-8f29f9cffb91
relation.isAuthorOfPublicationfef1a9cb-e346-4e53-9811-192e144f09d0
relation.isAuthorOfPublication.latestForDiscoveryc673c8b1-1afc-48f6-85e9-8f29f9cffb91

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Perez_Anxo_2025_DepreSym.pdf
Size:
767.03 KB
Format:
Adobe Portable Document Format