• TIR over Egyptian Hieroglyphs 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (IEEE Computer Society Press, 2016-09)
      [Abstract] This work presents an Information Retrieval system specifically designed to manage Ancient Egyptian hieroglyphic texts taking into account their peculiarities both at lexical and at encoding level for its ...
    • Tokenization and proper noun recognition for information retrieval 

      Barcala Rodríguez, Francisco Mario; Vilares, Jesús; Alonso, Miguel A.; Graña Gil, Jorge; Vilares Ferro, Manuel (IEEE Computer Society Press, 2005-11-21)
      [Abstract] In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic ...
    • Towards Robust Word Embeddings for Noisy Texts 

      Doval, Yerai; Vilares, Jesús; Gómez-Rodríguez, Carlos (MDPI, 2020)
      [Abstract] Research on word embeddings has mainly focused on improving their performance on standard corpora, disregarding the difficulties posed by noisy texts in the form of tweets and other types of non-standard writing ...