Better Benchmarking LLMs for Zero-Shot Dependency Parsing
| UDC.coleccion | Investigación | es_ES |
| UDC.conferenceTitle | NoDaLiDa/Baltic-HLT 2025 | es_ES |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | es_ES |
| UDC.endPage | 135 | es_ES |
| UDC.grupoInv | Lingua e Sociedade da Información (LYS) | es_ES |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | es_ES |
| UDC.startPage | 121 | es_ES |
| dc.contributor.author | Ezquerro, Ana | |
| dc.contributor.author | Gómez-Rodríguez, Carlos | |
| dc.contributor.author | Vilares, David | |
| dc.date.accessioned | 2025-03-03T11:03:13Z | |
| dc.date.available | 2025-03-03T11:03:13Z | |
| dc.date.issued | 2025-03 | |
| dc.description | Presented at: Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 121–135, March 3-4, 2025 ©2025 University of Tartu Library | es_ES |
| dc.description | Código asociado: https://github.com/anaezquerro/naipar | es_ES |
| dc.description.abstract | [Abstract]: While LLMs excel in zero-shot tasks, their performance in linguistic challenges like syntactic parsing has been less scrutinized. This paper studies state-of-the-art openweight LLMs on the task by comparing them to baselines that do not have access to the input sentence, including baselines that have not been used in this context such as random projective trees or optimal linear arrangements. The results show that most of the tested LLMs cannot outperform the best uninformed baselines, with only the newest and largest versions of LLaMA doing so for most languages, and still achieving rather low performance. Thus, accurate zero-shot syntactic parsing is not forthcoming with open LLMs. | es_ES |
| dc.description.sponsorship | We acknowledge grants SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033; GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF, EU; LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF, EU; and TSI-100925-2023-1 funded by Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR; as well as funding by Xunta de Galicia (ED431C 2024/02), and Centro de Investigación de Galicia “CITIC”, funded by the Xunta de Galicia through the collaboration agreement between the Consellería de Cultura, Educación, Formación Profesional e Universidades and the Galician universities for the reinforcement of the research centres of the Galician University System (CIGUS). We also extend our gratitude to CESGA, the supercomputing center of Galicia, for granting us access to its resources. | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2024/02 | es_ES |
| dc.identifier.citation | A. Ezquerro, C. Gómez-Rodríguez, and D. Vilares, "Better Benchmarking LLMs for Zero-Shot Dependency Parsing", Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), University of Tartu Library, pp. 121–135, March 3-4, 2025 | es_ES |
| dc.identifier.other | https://hdl.handle.net/10062/107204 | |
| dc.identifier.uri | http://hdl.handle.net/2183/41295 | |
| dc.language.iso | eng | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACIÓN LINGÜÍSTICA: SINTAXIS E INTEGRACIÓN MULTITAREA (SCANNER-UDC) | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES | es_ES |
| dc.rights | Atribución-NoComercial-SinDerivadas 3.0 España | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
| dc.subject | LLMs | es_ES |
| dc.subject | Large language models | es_ES |
| dc.subject | Syntactic parsing | es_ES |
| dc.title | Better Benchmarking LLMs for Zero-Shot Dependency Parsing | es_ES |
| dc.type | conference output | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 2f08b56a-af5a-4627-b111-5ccccc33d17d | |
| relation.isAuthorOfPublication | e70a3969-39f6-4458-9339-3b71756fa56e | |
| relation.isAuthorOfPublication | 37dabbe9-f54f-43bb-960e-0bf3ac7e54eb | |
| relation.isAuthorOfPublication.latestForDiscovery | 2f08b56a-af5a-4627-b111-5ccccc33d17d |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Vilares_David_2025_Better_Benchmarking_LLMs_for_Zero_Shot_Dependency_Parsing.pdf
- Size:
- 1.38 MB
- Format:
- Adobe Portable Document Format
- Description:

