Integrating external dictionaries into Part-of-speech taggers

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
dc.contributor.authorGraña Gil, Jorge
dc.contributor.authorChappelier, J. C.
dc.contributor.authorVilares Ferro, Manuel
dc.date.accessioned2005-11-21T13:23:07Z
dc.date.available2005-11-21T13:23:07Z
dc.date.issued2001
dc.description.abstract[Abstract] The highest performances in part-of-speech tagging have been obtained by using stochastic methods, such as hidden Markov models. The running parameters of a hidden Markov model for tagging can be estimated from tagged corpora. However, the current situation in the automatic processing of some languages is very short training texts, but very large dictionaries. These dictionaries can provide very useful information for improving the treatment of unknown words. In this paper we present new strategies for integrating external dictionaries into a stochastic tagging framework. Instead of the most intuitive Adding One method, we propose the use of the Good-Turing formulas, which produce less distortion of the model we are estimating. This technique guarantees good performances in the automatic processing of languages for which reference texts hardly exist.es_ES
dc.description.sponsorshipEuropean Commisision; 1FD97-0047-C04-02es_ES
dc.description.sponsorshipMinisterio de Educación y Ciencia; TIC2000-0370-C02-01
dc.description.sponsorshipXunta de Galicia; PGIDT99XI10502B.
dc.format.mimetypeapplication/postscript
dc.format.mimetypetext/plain
dc.identifier.citationAngelova, G.; Bontcheva, K.; Mitkov, R.; Nicolov, N.; Nikolov, N. (eds.), Proceedings of the Euroconference on Recent Advances in Natural Language Processing (RANLP-2001), Tzigov Chark (Bulgaria), pp. 122-128.es_ES
dc.identifier.isbn954-90906-1-2
dc.identifier.urihttp://hdl.handle.net/2183/163
dc.language.isoenges_ES
dc.rights.accessRightsopen accesses_ES
dc.titleIntegrating external dictionaries into Part-of-speech taggerses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication42896d75-4435-4f99-82e4-48a60a48d799
relation.isAuthorOfPublication3d821e9c-de0b-47cc-a4e0-7c531569602e
relation.isAuthorOfPublication.latestForDiscovery42896d75-4435-4f99-82e4-48a60a48d799

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GranaRANLP2001.ps
Size:
183.31 KB
Format:
Postscript Files