Hierarchical Bracketing Encodings for Dependency Parsing as Tagging
| UDC.coleccion | Investigación | |
| UDC.conferenceTitle | 63rd Annual Meeting of the Association for Computational Linguistics | |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | |
| UDC.endPage | 18450 | |
| UDC.grupoInv | Lingua e Sociedade da Información (LYS) | |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | |
| UDC.startPage | 18436 | |
| UDC.volume | 1 | |
| dc.contributor.author | Ezquerro, Ana | |
| dc.contributor.author | Vilares, David | |
| dc.contributor.author | Yli-Jyrä, Anssi | |
| dc.contributor.author | Gómez-Rodríguez, Carlos | |
| dc.date.accessioned | 2025-09-11T14:49:19Z | |
| dc.date.available | 2025-09-11T14:49:19Z | |
| dc.date.issued | 2025-07 | |
| dc.description | Presented at the 63rd Annual Meeting of the Association for Computational Linguistics, July 2025, Vienna, Austria. | |
| dc.description.abstract | [Abstract]: We present a family of encodings for sequence labeling dependency parsing, based on the concept of hierarchical bracketing. We show that the existing 4-bit projective encoding belongs to this family, but it is suboptimal in the number of labels used to encode a tree. We derive an optimal hierarchical bracketing, which minimizes the number of symbols used and encodes projective trees using only 12 distinct labels (vs. 16 for the 4-bit encoding). We also extend optimal hierarchical bracketing to support arbitrary non-projectivity in a more compact way than previous encodings. Our new encodings yield competitive accuracy on a diverse set of treebanks. | |
| dc.description.sponsorship | We acknowledge grants SCANNER-UDC (PID2020-113230RB-C21) funded by MICIU/AEI/10.13039/501100011033; GAP (PID2022-139308OA-I00) funded by MICIU/AEI/10.13039/501100011033/ and ERDF, EU; LATCHING (PID2023-147129OB-C21) funded by MICIU/AEI/10.13039/501100011033 and ERDF, EU; and TSI-100925-2023-1 funded by Ministry for Digital Transformation and Civil Service and “NextGenerationEU” PRTR; as well as funding by Xunta de Galicia (ED431C 2024/02), and CITIC, as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021-27 operational program (Ref. ED431G 2023/01). We also extend our gratitude to CESGA, the supercomputing center of Galicia, for granting us access to its resources. Furthermore, we acknowledge the Faculty of Agriculture and Forestry of the University of Helsinki, as well as projects "Theory of Computational Logics" (352420) and "XAILOG" (345612, 345633) funded by the Research Council of Finland for the continued support of the third author during the multistage writing process. | |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2024/02 | |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2023/01 | |
| dc.description.sponsorship | Finland. Research Council of Finland; 352420 | |
| dc.description.sponsorship | Finland. Research Council of Finland; 345612 | |
| dc.description.sponsorship | Finland. Research Council of Finland; 345633 | |
| dc.identifier.citation | Ana Ezquerro, David Vilares, Anssi Yli-Jyrä, and Carlos Gómez-Rodríguez. 2025. Hierarchical Bracketing Encodings for Dependency Parsing as Tagging. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 18436–18450, Vienna, Austria. Association for Computational Linguistics. DOI: 10.18653/v1/2025.acl-long.903 | |
| dc.identifier.doi | 10.18653/v1/2025.acl-long.903 | |
| dc.identifier.isbn | 979-8-89176-251-0 | |
| dc.identifier.uri | https://hdl.handle.net/2183/45744 | |
| dc.language.iso | eng | |
| dc.publisher | Association for Computational Linguistics | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACION LINGUISTICA: SINTAXIS E INTEGRACION MULTITAREA (SCANNER-UDC)/ | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-139308OA-100/ES/REPRESENTACIONES ESTRUCTURADAS VERDES Y ENCHUFABLES | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147129OB-C21/ES/TECNOLOGÍAS DEL LENGUAJE DESDE UNA PERSPECTIVA VERDE (LATCHING): DOMINIOS CON ESCASOS RECURSOS | |
| dc.relation.projectID | info:eu-repo/grantAgreement/MTDPF//TSI-100925-2023-1/ES/CÁTEDRA UDC-INDITEX DE IA EN ALGORITMOS VERDES | |
| dc.relation.uri | https://doi.org/10.18653/v1/2025.acl-long.903 | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Dependency parsing | |
| dc.subject | Tagging | |
| dc.subject | Encodings | |
| dc.title | Hierarchical Bracketing Encodings for Dependency Parsing as Tagging | |
| dc.type | conference output | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 2f08b56a-af5a-4627-b111-5ccccc33d17d | |
| relation.isAuthorOfPublication | 37dabbe9-f54f-43bb-960e-0bf3ac7e54eb | |
| relation.isAuthorOfPublication | e70a3969-39f6-4458-9339-3b71756fa56e | |
| relation.isAuthorOfPublication.latestForDiscovery | 2f08b56a-af5a-4627-b111-5ccccc33d17d |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Vilares_David_2025_Hierarchical_Bracketing_Encodings_for_Dependency_Parsing_as_Tagging.pdf
- Size:
- 287.19 KB
- Format:
- Adobe Portable Document Format

