Mostrar o rexistro simple do ítem
Formal methods of tokenization for part-of-speech tagging
dc.contributor.author | Graña Gil, Jorge | |
dc.contributor.author | Barcala Rodríguez, Francisco Mario | |
dc.contributor.author | Vilares Ferro, Manuel | |
dc.date.accessioned | 2005-10-28T15:43:37Z | |
dc.date.available | 2005-10-28T15:43:37Z | |
dc.date.issued | 2002 | |
dc.identifier.citation | Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing (CICLING-2002), Ciudad de Méjico (Méjico). Published in Lecture Notes in Computer Science, vol. 2276, pp. 240-249. Springer Verlag. Gelbukh, A. (ed.). | es_ES |
dc.identifier.isbn | 3-540-43219-1 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/2183/148 | |
dc.description.abstract | [Abstract] One of the most important prior tasks for robust part-of-speech tagging is the correct tokenization or segmentation of the texts. This task can involve processes which are much more complex than the simple identification of the diferent sentences in the text and each of their individual components, but it is often obviated in many current applications. Nevertheless, this preprocessing step is an indispensable task in practice, and it is particularly dificult to tackle it with scientific precision with-out falling repeatedly in the analysis of the specific casuistry of every phenomenon detected. In this work, we have developed a scheme of preprocessing oriented towards the disambiguation and robust tagging of Galician. Nevertheless, it is a proposal of a general architecture that can be applied to other languages, such as Spanish, with very slight modifications. | es_ES |
dc.description.sponsorship | European Commission; 1FD97-0047-C04-02 | es_ES |
dc.description.sponsorship | Xunta de Galicia; PGIDT99XI10502B | |
dc.description.sponsorship | Ministerio de Educación y Ciencia; TIC2000-0370-C02-01 | |
dc.format.mimetype | application/postscript | |
dc.format.mimetype | text/plain | |
dc.language.iso | eng | es_ES |
dc.publisher | Springer-Verlag | es_ES |
dc.title | Formal methods of tokenization for part-of-speech tagging | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.access | info:eu-repo/semantics/openAccess | es_ES |
Ficheiros no ítem
Este ítem aparece na(s) seguinte(s) colección(s)
-
GI-COLE - Artigos [10]