Tokenization and proper noun recognition for information retrieval
Ver/ abrir
Use este enlace para citar
http://hdl.handle.net/2183/162Coleccións
Metadatos
Mostrar o rexistro completo do ítemTítulo
Tokenization and proper noun recognition for information retrievalAutor(es)
Data
2005-11-21Cita bibliográfica
Proceedings of the Thirteen International Workshop on Database and Expert Systems Applications (DEXA-2002) / Third International Workshop on Natural Language and Information Systems (NLIS-2002), Aix-en-Provence (France),Tjoa, A.M.; Wagner, R.R. (eds.). pp. 246-250.
Resumo
[Abstract] In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic phenomena, as well as for pre-tagging tasks such as proper noun recognition. We also show the results of several experiments performed in order to study the impact of the strategy chosen for the recognition of proper nouns
ISSN
1529-4188
ISBN
0-7695-1668-8