• A review on political analysis and social media 

      Vilares, David; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016)
      [Abstract] In democratic countries, forecasting the voting intentions of citizens and knowing their opinions on major political parties and leaders is of great interest to the parties themselves, to the media, and to the ...
    • Intelligent retrieval for biodiversity 

      Vilares Ferro, Manuel; Fernández, Milagros; Blanco, Adrián; Gómez-Rodríguez, Carlos (2016-02)
      [Abstract] A knowledge discovery and representation frame to mine contents in systems biology is described. It applies natural language processing to integrate linguistic and domain knowledge in a mathematical model for ...
    • On the Feasibility of Character n-Grams Pseudo-Translation for Cross-Language Information Retrieval Tasks 

      Vilares, Jesús; Vilares Ferro, Manuel; Alonso, Miguel A.; Oakes, Michael P. (2016-03)
      [Abstract] The field of Cross-Language Information Retrieval relates techniques close to both the Machine Translation and Information Retrieval fields, although in a context involving characteristics of its own. The present ...
    • Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level 

      García, Marcos (European Language Resources Association (ELRA), 2016-05)
      [Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...
    • EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)
      [Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...
    • Any papyrus about "a hand over a stool and a bread loaf, followed by a boat"? Dealing with hieroglyphic texts in IR 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (ACM International Conference Proceeding Series, 2016-06)
      [Abstract] Digital Heritage deals with the use of computing and information technologies for the preservation and study of the human cultural legacy. Within this context, we present here a Text Retrieval system developed ...
    • The scaling of the minimum sum of edge lengths in uniformly random trees 

      Esteban, Juan Luis; Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016-06)
      [Abstract] The minimum linear arrangement problem on a network consists of finding the minimum sum of edge lengths that can be achieved when the vertices are arranged linearly. Although there are algorithms to solve this ...
    • Semantic Relation Extraction. Resources, Tools and Strategies 

      García, Marcos (Springer, 2016-07)
      [Abstract] Relation extraction is a subtask of information extraction that aims at obtaining instances of semantic relations present in texts. This information can be arranged in machine-readable formats, useful for several ...
    • Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval 

      Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)
      [Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ...
    • Lyapunov filtering of objectivity for Spanish sentiment model 

      Chaturvedi, Iti; Cambria, Erik; Vilares, David (IEEE, 2016-07)
      [Abstract] Objective sentences lack sentiments and, hence, can reduce the accuracy of a sentiment classifier. Traditional methods prior to 2001 used hand-crafted templates to identify subjectivity and did not generalize ...
    • Entity linking with distributional semantics 

      Gamallo, Pablo; García, Marcos (Springer, 2016-07)
      [Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...
    • Searching Four-Millennia-Old Documents: A Text Retrieval System for Egyptologists 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (2016-08)
      [Abstract] Progress made in recent years has led to a growing interest in Digital Heritage. This article focuses on Egyptology and, more specifically, the study and preservation of ancient Egyptian scripts. We present a ...
    • TIR over Egyptian Hieroglyphs 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (IEEE Computer Society Press, 2016-09)
      [Abstract] This work presents an Information Retrieval system specifically designed to manage Ancient Egyptian hieroglyphic texts taking into account their peculiarities both at lexical and at encoding level for its ...
    • Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ...
    • Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego 

      García, Marcos; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En este trabajo presentamos una nueva estrategia para crear treebanks de lenguas con pocos recursos para el análisis sintáctico. El método consiste en la adaptación y combinación de diferentes treebanks anotados ...
    • Restricted Non-Projectivity: Coverage vs. Efficiency 

      Gómez-Rodríguez, Carlos (2016-12)
      [Abstract] In the last decade, various restricted classes of non-projective dependency trees have been proposed with the goal of achieving a good tradeoff between parsing efficiency and coverage of the syntactic structures ...
    • Shallow Recurrent Neural Network for Personality Recognition in Source Code 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)
      [Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ...
    • Universal, unsupervised (rule-based), uncovered sentiment analysis 

      Vilares, David; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Elsevier, 2017-02)
      [Abstract]: We present a novel unsupervised approach for multilingual sentiment analysis driven by compositional syntax-based rules. On the one hand, we exploit some of the main advantages of unsupervised algorithms: (1) ...
    • Supervised sentiment analysis in multilingual environments 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Elsevier, 2017-05)
      [Abstract]: This article tackles the problem of performing multilingual polarity classification on Twitter, comparing three techniques: (1) a multilingual model trained on a multilingual dataset, obtained by fusing existing ...
    • A non-projective greedy dependency parser with bidirectional LSTMs 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2017-08)
      [Abstract]: The LyS-FASTPARSE team present BIST-COVINGTON, a neural implementation of the Covington (2001) algorithm for non-projective dependency parsing. The bidirectional LSTM approach by Kiperwasser and Goldberg (2016) ...