A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis
| UDC.coleccion | Investigación | |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | |
| UDC.grupoInv | Lingua e Sociedade da Información (LYS) | |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | |
| UDC.journalTitle | PeerJ Computer Science | |
| UDC.startPage | e3519 | |
| UDC.volume | 12 | |
| dc.contributor.author | Imran, Muhammad | |
| dc.contributor.author | Kellert, Olga | |
| dc.contributor.author | Gómez-Rodríguez, Carlos | |
| dc.date.accessioned | 2026-04-13T09:44:29Z | |
| dc.date.available | 2026-04-13T09:44:29Z | |
| dc.date.issued | 2026-01-30 | |
| dc.description | The following information was supplied regarding data availability: The OpeNERen, OpeNERes, UD_English-EWT and UD_Spanish-AnCora datasets are available at Zenodo: Muhammad Imran. (2025). chimran135/syntax-injected-sentimentanalysis: Syntax-injected Sentiment Analysis (1.0.0). Zenodo. https://doi.org/10.5281/zenodo.15323755. The Rest-Mex 2023 shared task website provides access to the training data for registered participants. The data are held privately for assessment purposes and can be obtained from Rest-Mex 2023 organizers https://sites.google.com/cimat.mx/rest-mex2023 by contacting Miguel Ángel Álvarez Carmona at miguel.alvarez@cimat.mx. The Rest-Mex 2023 dataset was used under license for this study. | |
| dc.description.abstract | [Abstract]: Sentiment Analysis (SA) is a crucial aspect of Natural Language Processing (NLP), focusing on identifying and interpreting subjective assessments in textual content. Syntactic parsing is useful in SA as it improves accuracy and provides explainability; however, it often becomes a computational bottleneck due to slow parsing algorithms. This article proposes a solution to this bottleneck by using a Sequence Labeling Syntactic Parser (SELSP) to integrate syntactic information into SA via a rule-based sentiment analysis pipeline. By reformulating dependency parsing as a sequence labeling task, we significantly improve the efficiency of syntax-based SA. SELSP is trained and evaluated on a ternary polarity classification task, demonstrating greater speed and accuracy compared to conventional parsers like Stanza and heuristic approaches such as Valence Aware Dictionary and sEntiment Reasoner (VADER). The combination of speed and accuracy makes SELSP especially attractive for sentiment analysis applications in both academic and industrial contexts. Moreover, we compare SELSP with Transformer-based models trained on a 5-label classification task. In addition, we evaluate multiple sentiment dictionaries with SELSP to determine which yields the best performance in polarity prediction. The results show that dictionaries accounting for polarity judgment variation outperform those that ignore it. Furthermore, we show that SELSP outperforms Transformer-based models in terms of speed for polarity prediction. | |
| dc.description.sponsorship | The European Research Council (ERC) funded this research under the Horizon Europe research and innovation programme (SALSA, grant agreement No. 101100615), SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033, LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF (EU), project GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF (EU), Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR under grant TSI-100925-2023-1, Xunta de Galicia (ED431C 2024/02), and Galician Research Center “CITIC”, funded by Xunta de Galicia through the collaboration agreement between the Consellería de Cultura, Educación, Formación Profesional e Universidades and the Galician universities for the reinforcement of the research centres of the Galician University System (CIGUS). Furthermore, this research was supported by the International, Interdisciplinary and Intersectoral Information and Communications Technology PhD programme (3-i ICT) granted to CITIC and supported by the European Union through the Horizon 2020 research and innovation programme under a Marie Skłodowska-Curie agreement (H2020-MSCA-COFUND), GA 101034261. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. | |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2024/02 | |
| dc.identifier.citation | Imran M, Kellert O, Gómez-Rodríguez C. 2026. A syntax-injected approach for faster and more accurate sentiment analysis. PeerJ Computer Science 12:e3519 https://doi.org/10.7717/peerj-cs.3519 | |
| dc.identifier.doi | 10.7717/peerj-cs.3519 | |
| dc.identifier.issn | 2376-5992 | |
| dc.identifier.uri | https://hdl.handle.net/2183/47940 | |
| dc.language.iso | eng | |
| dc.publisher | PeerJ | |
| dc.relation.isbasedon | https://doi.org/10.5281/zenodo.15323755 | |
| dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/101034261 | |
| dc.relation.projectID | info:eu-repo/grantAgreement/EC/HE/101100615 | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACION LINGUISTICA: SINTAXIS E INTEGRACION MULTITAREA (SCANNER-UDC) | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES | |
| dc.relation.projectID | info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES | |
| dc.relation.uri | https://doi.org/10.7717/peerj-cs.3519 | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Sentiment analysis | |
| dc.subject | Sequence labeling parsing | |
| dc.subject | Syntactic knowledge | |
| dc.subject | Opinion mining | |
| dc.title | A Syntax-Injected Approach for Faster and More Accurate Sentiment Analysis | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 6779b734-3d4b-4242-9bde-78e83eea84db | |
| relation.isAuthorOfPublication | da9e8872-ab78-4a1c-8212-1121388beb43 | |
| relation.isAuthorOfPublication | e70a3969-39f6-4458-9339-3b71756fa56e | |
| relation.isAuthorOfPublication.latestForDiscovery | 6779b734-3d4b-4242-9bde-78e83eea84db |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Imran_Muhammad_2026_A_syntax_injected_approach.pdf
- Size:
- 8.39 MB
- Format:
- Adobe Portable Document Format

