Bao, EliseoPérez, AnxoOtero, DavidParapar, Javier2025-11-052025-11-052025-10-17E. Bao, A. Perez, D. Otero, y J. Parapar, «How does depression talk on social media? Modeling depression language with relevance-based statistical language models», Online Social Networks and Media, vol. 50, p. 100339, dic. 2025, doi: 10.1016/j.osnem.2025.1003392468-6964https://hdl.handle.net/2183/46281[Abstract]: Many individuals with mental health problems turn to the internet and social media for information and support. The text generated on these platforms serves as a valuable resource for identifying mental health risks, driving interdisciplinary research to develop models for mental health analysis and prediction. In this paper, we model depression-related language using relevance-based statistical language models to create lexicons that characterize linguistic patterns associated with depression. We also propose a ranking method that leverages these lexicons to prioritize users exhibiting stronger signs of depressive language on social media. Our models integrate clinical markers from established depression questionnaires, particularly the Beck Depression Inventory-II (BDI-II), enhancing explainability, generalization, and performance. Experiments across multiple social media datasets show that incorporating clinical knowledge improves user ranking and generalizes effectively across platforms. Additionally, we refine existing depression lexicons by applying weights estimated from our models, achieving better performance in generating depression-related queries. A comparative analysis of our models highlights differences in language use between control users and those with depression, aligning with prior psycholinguistic findings. This work advances the understanding of depression-related language through statistical modeling, paving the way for scalable social media interventions to identify at-risk individualseng© 2025 The AuthorsAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/Mental healthDepressionLanguage modelingNatural language processingText miningSocial mediaUser risk assessmentClinical markersLinguistic patternsPsycholinguisticsHow does depression talk on social media? Modeling depression language with relevance-based statistical language modelsjournal articleopen access10.1016/j.osnem.2025.100339