• Absolute convergence and error thresholds in non-active adaptive sampling 

      Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Vilares, Jesús (Elsevier Inc., 2022-05)
      [Abstract] Non-active adaptive sampling is a way of building machine learning models from a training data base which are supposed to dynamically and automatically derive guaranteed sample size. In this context and regardless ...
    • Any papyrus about "a hand over a stool and a bread loaf, followed by a boat"? Dealing with hieroglyphic texts in IR 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (ACM International Conference Proceeding Series, 2016-06)
      [Abstract] Digital Heritage deals with the use of computing and information technologies for the preservation and study of the human cultural legacy. Within this context, we present here a Text Retrieval system developed ...
    • Developing Open-Source Roguelike Games for Visually-Impaired Players by Using Low-Complexity NLP Techniques 

      Fernández-Núñez, Luis; Penas, Darío; Viteri Letamendía, Jorge; Vilares, Jesús (MDPI, 2020-08-19)
      [Abstract] The prominent graphic component of video games greatly limits the accessibility of thistype of entertainment by visually impaired users. We make here an overview of the first gamesdeveloped within an initiative ...
    • Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico 

      Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)
      [Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...
    • On the Feasibility of Character n-Grams Pseudo-Translation for Cross-Language Information Retrieval Tasks 

      Vilares, Jesús; Vilares Ferro, Manuel; Alonso, Miguel A.; Oakes, Michael P. (2016-03)
      [Abstract] The field of Cross-Language Information Retrieval relates techniques close to both the Machine Translation and Information Retrieval fields, although in a context involving characteristics of its own. The present ...
    • On the performance of phonetic algorithms in microtext normalization 

      Doval, Yerai; Vilares Ferro, Manuel; Vilares, Jesús (Elsevier, 2018-12-15)
      [Abstract]: User–generated content published on microblogging social networks constitutes a priceless source of information. However, microtexts usually deviate from the standard lexical and grammatical rules of the language, ...
    • On the Use of Parsing for Named Entity Recognition 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021-01-25)
      [Abstract] Parsing is a core natural language processing technique that can be used to obtain the structure underlying sentences in human languages. Named entity recognition (NER) is the task of identifying the entities ...
    • Prototipado rápido de un sistema de normalización de tuitsuna aproximación léxica 

      Vilares, Jesús; Alonso, Miguel A.; Vilares, David (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: Este trabajo describe el sistema de normalización de tuits en español desarrollado por el Grupo de Lengua Y Sociedad de la Información (LYS) de la Universidade da Coruña para el Tweet-Norm 2013. Se trata de un ...
    • Searching Four-Millennia-Old Documents: A Text Retrieval System for Egyptologists 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (2016-08)
      [Abstract] Progress made in recent years has led to a growing interest in Digital Heritage. This article focuses on Egyptology and, more specifically, the study and preservation of ancient Egyptian scripts. We present a ...
    • Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ...
    • Seguimiento y análisis automático de contenidos en redes sociales 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, David; Doval, Yerai; Vilares, Jesús (Centro Universitario de la Defensa de Marín, 2015)
      [Abstract]: La Minería de Opiniones es la disciplina que aborda el tratamiento automático de las opiniones contenidas en un texto. Permite, por ejemplo, determinar si en un texto se está opinando o no, o si la polaridad ...
    • Sentiment Analysis for Fake News Detection 

      Alonso, Miguel A.; Vilares, David; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021)
      [Abstract] In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention of deception. The dissemination of this type of news poses a serious threat to cohesion ...
    • Shallow Recurrent Neural Network for Personality Recognition in Source Code 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)
      [Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ...
    • Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval 

      Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)
      [Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ...
    • TIR over Egyptian Hieroglyphs 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (IEEE Computer Society Press, 2016-09)
      [Abstract] This work presents an Information Retrieval system specifically designed to manage Ancient Egyptian hieroglyphic texts taking into account their peculiarities both at lexical and at encoding level for its ...
    • Towards Robust Word Embeddings for Noisy Texts 

      Doval, Yerai; Vilares, Jesús; Gómez-Rodríguez, Carlos (MDPI, 2020)
      [Abstract] Research on word embeddings has mainly focused on improving their performance on standard corpora, disregarding the difficulties posed by noisy texts in the form of tweets and other types of non-standard writing ...