UzbekVerbDetection: Rule-based Detection of Verbs in Uzbek Texts
Use este enlace para citar
http://hdl.handle.net/2183/38024
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial 4.0 Internacional (CC-BY-NC 4.0)
Colecciones
- Investigación (FFIL) [816]
Metadatos
Mostrar el registro completo del ítemTítulo
UzbekVerbDetection: Rule-based Detection of Verbs in Uzbek TextsFecha
2024Cita bibliográfica
Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev, and Ogabek Sobirov. 2024. UzbekVerbDetection: Rule-based Detection of Verbs in Uzbek Texts. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 17343–17347, Torino, Italia. ELRA and ICCL.
Resumen
[Abstract]: Verb detection is a fundamental task in natural language processing that involves identifying the action or state expressed by a verb in a sentence. However, in Uzbek language, verb detection is challenging due to the complexity of its morphology and the agglutinative nature of the language. In this paper, we propose a rule-based approach for verb detection in Uzbek texts based on affixes/suffixes. Our method is based on a set of rules that capture the morphological patterns of verb forms in Uzbek language. We evaluate the proposed approach on a dataset of Uzbek texts and report an F1-score of 0.97, which outperforms existing methods for verb detection in Uzbek language. Our results suggest that rule-based approaches can be effective for verb detection in Uzbek texts and have potential applications in various natural language processing tasks.
Palabras clave
Verbs
Affixes
Suffixes
FSM
Rule-based
NLP
Uzbek NLP
Verb detection
Affixes
Suffixes
FSM
Rule-based
NLP
Uzbek NLP
Verb detection
Descripción
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) held in Torino (Italia)
20-25 May, 2024.
Versión del editor
Derechos
Atribución-NoComercial 4.0 Internacional (CC-BY-NC 4.0)
ISBN
978-249381410-4