A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis

Imran, Muhammad; Kellert, Olga; Gómez-Rodríguez, Carlos

A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis

UDC.coleccion	Investigación
UDC.departamento	Ciencias da Computación e Tecnoloxías da Información
UDC.grupoInv	Lingua e Sociedade da Información (LYS)
UDC.institutoCentro	CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitle	PeerJ Computer Science
UDC.startPage	e3519
UDC.volume	12
dc.contributor.author	Imran, Muhammad
dc.contributor.author	Kellert, Olga
dc.contributor.author	Gómez-Rodríguez, Carlos
dc.date.accessioned	2026-04-13T09:44:29Z
dc.date.available	2026-04-13T09:44:29Z
dc.date.issued	2026-01-30
dc.description	The following information was supplied regarding data availability: The OpeNERen, OpeNERes, UD_English-EWT and UD_Spanish-AnCora datasets are available at Zenodo: Muhammad Imran. (2025). chimran135/syntax-injected-sentimentanalysis: Syntax-injected Sentiment Analysis (1.0.0). Zenodo. https://doi.org/10.5281/zenodo.15323755. The Rest-Mex 2023 shared task website provides access to the training data for registered participants. The data are held privately for assessment purposes and can be obtained from Rest-Mex 2023 organizers https://sites.google.com/cimat.mx/rest-mex2023 by contacting Miguel Ángel Álvarez Carmona at miguel.alvarez@cimat.mx. The Rest-Mex 2023 dataset was used under license for this study.
dc.description.abstract	[Abstract]: Sentiment Analysis (SA) is a crucial aspect of Natural Language Processing (NLP), focusing on identifying and interpreting subjective assessments in textual content. Syntactic parsing is useful in SA as it improves accuracy and provides explainability; however, it often becomes a computational bottleneck due to slow parsing algorithms. This article proposes a solution to this bottleneck by using a Sequence Labeling Syntactic Parser (SELSP) to integrate syntactic information into SA via a rule-based sentiment analysis pipeline. By reformulating dependency parsing as a sequence labeling task, we significantly improve the efficiency of syntax-based SA. SELSP is trained and evaluated on a ternary polarity classification task, demonstrating greater speed and accuracy compared to conventional parsers like Stanza and heuristic approaches such as Valence Aware Dictionary and sEntiment Reasoner (VADER). The combination of speed and accuracy makes SELSP especially attractive for sentiment analysis applications in both academic and industrial contexts. Moreover, we compare SELSP with Transformer-based models trained on a 5-label classification task. In addition, we evaluate multiple sentiment dictionaries with SELSP to determine which yields the best performance in polarity prediction. The results show that dictionaries accounting for polarity judgment variation outperform those that ignore it. Furthermore, we show that SELSP outperforms Transformer-based models in terms of speed for polarity prediction.
dc.description.sponsorship	The European Research Council (ERC) funded this research under the Horizon Europe research and innovation programme (SALSA, grant agreement No. 101100615), SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033, LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF (EU), project GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF (EU), Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR under grant TSI-100925-2023-1, Xunta de Galicia (ED431C 2024/02), and Galician Research Center “CITIC”, funded by Xunta de Galicia through the collaboration agreement between the Consellería de Cultura, Educación, Formación Profesional e Universidades and the Galician universities for the reinforcement of the research centres of the Galician University System (CIGUS). Furthermore, this research was supported by the International, Interdisciplinary and Intersectoral Information and Communications Technology PhD programme (3-i ICT) granted to CITIC and supported by the European Union through the Horizon 2020 research and innovation programme under a Marie Skłodowska-Curie agreement (H2020-MSCA-COFUND), GA 101034261. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
dc.description.sponsorship	Xunta de Galicia; ED431C 2024/02
dc.identifier.citation	Imran M, Kellert O, Gómez-Rodríguez C. 2026. A syntax-injected approach for faster and more accurate sentiment analysis. PeerJ Computer Science 12:e3519 https://doi.org/10.7717/peerj-cs.3519
dc.identifier.doi	10.7717/peerj-cs.3519
dc.identifier.issn	2376-5992
dc.identifier.uri	https://hdl.handle.net/2183/47940
dc.language.iso	eng
dc.publisher	PeerJ
dc.relation.isbasedon	https://doi.org/10.5281/zenodo.15323755
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/101034261
dc.relation.projectID	info:eu-repo/grantAgreement/EC/HE/101100615
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACION LINGUISTICA: SINTAXIS E INTEGRACION MULTITAREA (SCANNER-UDC)
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES
dc.relation.projectID	info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES
dc.relation.uri	https://doi.org/10.7717/peerj-cs.3519
dc.rights	Attribution 4.0 International	en
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Sentiment analysis
dc.subject	Sequence labeling parsing
dc.subject	Syntactic knowledge
dc.subject	Opinion mining
dc.title	A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis
dc.type	journal article
dc.type.hasVersion	VoR
dspace.entity.type	Publication
relation.isAuthorOfPublication	6779b734-3d4b-4242-9bde-78e83eea84db
relation.isAuthorOfPublication	da9e8872-ab78-4a1c-8212-1121388beb43
relation.isAuthorOfPublication	e70a3969-39f6-4458-9339-3b71756fa56e
relation.isAuthorOfPublication.latestForDiscovery	6779b734-3d4b-4242-9bde-78e83eea84db

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Imran_Muhammad_2026_A_syntax_injected_approach.pdf
Size:: 8.39 MB
Format:: Adobe Portable Document Format

Download

Collections

Investigación (FFIL)