Buscar
Mostrando ítems 1-7 de 7
Stochastic parsing and parallelism
(Springer-Verlag, 2001)
[Abstract] Parsing CYK-like algorithms are inherently parallel: there are a lot of cells in the chart that can be calculated simultaneously. In this work, we present a study on the appropriate techniques of paralle-lism ...
Formal methods of tokenization for part-of-speech tagging
(Springer-Verlag, 2002)
[Abstract] One of the most important prior tasks for robust part-of-speech tagging is the correct tokenization or segmentation of the texts. This task can involve processes which are much more complex than the simple ...
Tokenization and proper noun recognition for information retrieval
(IEEE Computer Society Press, 2005-11-21)
[Abstract] In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic ...
Practical NLP-Based Text Indexing
(Springer Verlag, 2002)
Compilation methods of minimal acyclic finite-state automata for large dictionaries.
(Springer-Verlag, 2001)
[Abstract] We present a reflection on the evolution of the different methods for constructing minimal deterministic acyclic finite-state automata from a finite set of words. We outline the most important methods, including ...