A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvLingua e Sociedade da Información (LYS)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitlePeerJ Computer Science
UDC.startPagee3519
UDC.volume12
dc.contributor.authorImran, Muhammad
dc.contributor.authorKellert, Olga
dc.contributor.authorGómez-Rodríguez, Carlos
dc.date.accessioned2026-04-13T09:44:29Z
dc.date.available2026-04-13T09:44:29Z
dc.date.issued2026-01-30
dc.descriptionThe following information was supplied regarding data availability: The OpeNERen, OpeNERes, UD_English-EWT and UD_Spanish-AnCora datasets are available at Zenodo: Muhammad Imran. (2025). chimran135/syntax-injected-sentimentanalysis: Syntax-injected Sentiment Analysis (1.0.0). Zenodo. https://doi.org/10.5281/zenodo.15323755. The Rest-Mex 2023 shared task website provides access to the training data for registered participants. The data are held privately for assessment purposes and can be obtained from Rest-Mex 2023 organizers https://sites.google.com/cimat.mx/rest-mex2023 by contacting Miguel Ángel Álvarez Carmona at miguel.alvarez@cimat.mx. The Rest-Mex 2023 dataset was used under license for this study.
dc.description.abstract[Abstract]: Sentiment Analysis (SA) is a crucial aspect of Natural Language Processing (NLP), focusing on identifying and interpreting subjective assessments in textual content. Syntactic parsing is useful in SA as it improves accuracy and provides explainability; however, it often becomes a computational bottleneck due to slow parsing algorithms. This article proposes a solution to this bottleneck by using a Sequence Labeling Syntactic Parser (SELSP) to integrate syntactic information into SA via a rule-based sentiment analysis pipeline. By reformulating dependency parsing as a sequence labeling task, we significantly improve the efficiency of syntax-based SA. SELSP is trained and evaluated on a ternary polarity classification task, demonstrating greater speed and accuracy compared to conventional parsers like Stanza and heuristic approaches such as Valence Aware Dictionary and sEntiment Reasoner (VADER). The combination of speed and accuracy makes SELSP especially attractive for sentiment analysis applications in both academic and industrial contexts. Moreover, we compare SELSP with Transformer-based models trained on a 5-label classification task. In addition, we evaluate multiple sentiment dictionaries with SELSP to determine which yields the best performance in polarity prediction. The results show that dictionaries accounting for polarity judgment variation outperform those that ignore it. Furthermore, we show that SELSP outperforms Transformer-based models in terms of speed for polarity prediction.
dc.description.sponsorshipThe European Research Council (ERC) funded this research under the Horizon Europe research and innovation programme (SALSA, grant agreement No. 101100615), SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033, LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF (EU), project GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF (EU), Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR under grant TSI-100925-2023-1, Xunta de Galicia (ED431C 2024/02), and Galician Research Center “CITIC”, funded by Xunta de Galicia through the collaboration agreement between the Consellería de Cultura, Educación, Formación Profesional e Universidades and the Galician universities for the reinforcement of the research centres of the Galician University System (CIGUS). Furthermore, this research was supported by the International, Interdisciplinary and Intersectoral Information and Communications Technology PhD programme (3-i ICT) granted to CITIC and supported by the European Union through the Horizon 2020 research and innovation programme under a Marie Skłodowska-Curie agreement (H2020-MSCA-COFUND), GA 101034261. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
dc.description.sponsorshipXunta de Galicia; ED431C 2024/02
dc.identifier.citationImran M, Kellert O, Gómez-Rodríguez C. 2026. A syntax-injected approach for faster and more accurate sentiment analysis. PeerJ Computer Science 12:e3519 https://doi.org/10.7717/peerj-cs.3519
dc.identifier.doi10.7717/peerj-cs.3519
dc.identifier.issn2376-5992
dc.identifier.urihttps://hdl.handle.net/2183/47940
dc.language.isoeng
dc.publisherPeerJ
dc.relation.isbasedonhttps://doi.org/10.5281/zenodo.15323755
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/101034261
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/HE/101100615
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACION LINGUISTICA: SINTAXIS E INTEGRACION MULTITAREA (SCANNER-UDC)
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES
dc.relation.urihttps://doi.org/10.7717/peerj-cs.3519
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectSentiment analysis
dc.subjectSequence labeling parsing
dc.subjectSyntactic knowledge
dc.subjectOpinion mining
dc.titleA Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication6779b734-3d4b-4242-9bde-78e83eea84db
relation.isAuthorOfPublicationda9e8872-ab78-4a1c-8212-1121388beb43
relation.isAuthorOfPublicatione70a3969-39f6-4458-9339-3b71756fa56e
relation.isAuthorOfPublication.latestForDiscovery6779b734-3d4b-4242-9bde-78e83eea84db

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Imran_Muhammad_2026_A_syntax_injected_approach.pdf
Size:
8.39 MB
Format:
Adobe Portable Document Format