Better Benchmarking LLMs for Zero-Shot Dependency Parsing

Ezquerro, Ana; Gómez-Rodríguez, Carlos; Vilares, David

Better Benchmarking LLMs for Zero-Shot Dependency Parsing

UDC.coleccion	Investigación	es_ES
UDC.conferenceTitle	NoDaLiDa/Baltic-HLT 2025	es_ES
UDC.departamento	Ciencias da Computación e Tecnoloxías da Información	es_ES
UDC.endPage	135	es_ES
UDC.grupoInv	Lingua e Sociedade da Información (LYS)	es_ES
UDC.institutoCentro	CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación	es_ES
UDC.startPage	121	es_ES
dc.contributor.author	Ezquerro, Ana
dc.contributor.author	Gómez-Rodríguez, Carlos
dc.contributor.author	Vilares, David
dc.date.accessioned	2025-03-03T11:03:13Z
dc.date.available	2025-03-03T11:03:13Z
dc.date.issued	2025-03
dc.description	Presented at: Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 121–135, March 3-4, 2025 ©2025 University of Tartu Library	es_ES
dc.description	Código asociado: https://github.com/anaezquerro/naipar	es_ES
dc.description.abstract	[Abstract]: While LLMs excel in zero-shot tasks, their performance in linguistic challenges like syntactic parsing has been less scrutinized. This paper studies state-of-the-art openweight LLMs on the task by comparing them to baselines that do not have access to the input sentence, including baselines that have not been used in this context such as random projective trees or optimal linear arrangements. The results show that most of the tested LLMs cannot outperform the best uninformed baselines, with only the newest and largest versions of LLaMA doing so for most languages, and still achieving rather low performance. Thus, accurate zero-shot syntactic parsing is not forthcoming with open LLMs.	es_ES
dc.description.sponsorship	We acknowledge grants SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033; GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF, EU; LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF, EU; and TSI-100925-2023-1 funded by Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR; as well as funding by Xunta de Galicia (ED431C 2024/02), and Centro de Investigación de Galicia “CITIC”, funded by the Xunta de Galicia through the collaboration agreement between the Consellería de Cultura, Educación, Formación Profesional e Universidades and the Galician universities for the reinforcement of the research centres of the Galician University System (CIGUS). We also extend our gratitude to CESGA, the supercomputing center of Galicia, for granting us access to its resources.	es_ES
dc.description.sponsorship	Xunta de Galicia; ED431C 2024/02	es_ES
dc.identifier.citation	A. Ezquerro, C. Gómez-Rodríguez, and D. Vilares, "Better Benchmarking LLMs for Zero-Shot Dependency Parsing", Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), University of Tartu Library, pp. 121–135, March 3-4, 2025	es_ES
dc.identifier.other	https://hdl.handle.net/10062/107204
dc.identifier.uri	http://hdl.handle.net/2183/41295
dc.language.iso	eng	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACIÓN LINGÜÍSTICA: SINTAXIS E INTEGRACIÓN MULTITAREA (SCANNER-UDC)	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES	es_ES
dc.rights	Atribución-NoComercial-SinDerivadas 3.0 España	es_ES
dc.rights.accessRights	open access	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	*
dc.subject	LLMs	es_ES
dc.subject	Large language models	es_ES
dc.subject	Syntactic parsing	es_ES
dc.title	Better Benchmarking LLMs for Zero-Shot Dependency Parsing	es_ES
dc.type	conference output	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	2f08b56a-af5a-4627-b111-5ccccc33d17d
relation.isAuthorOfPublication	e70a3969-39f6-4458-9339-3b71756fa56e
relation.isAuthorOfPublication	37dabbe9-f54f-43bb-960e-0bf3ac7e54eb
relation.isAuthorOfPublication.latestForDiscovery	2f08b56a-af5a-4627-b111-5ccccc33d17d

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Vilares_David_2025_Better_Benchmarking_LLMs_for_Zero_Shot_Dependency_Parsing.pdf
Size:: 1.38 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Investigación (FFIL)