• 20 years of the Grammar Matrix: cross-linguistic hypothesis testing of increasingly complex interactions 

      Zamaraeva, Olga; Curtis, Chris; Emerson, Guy; Fokkens, Antske; Goodman, Michael Wayne; Howell, Kristen; Trimble, T.J.; Bender, Emily M. (Institute of Computer Science, Polish Academy of Sciences, 2022-10-20)
      [Abstract] The Grammar Matrix project is a meta-grammar engineering framework expressed in Head-driven Phrase Structure Grammar (HPSG) and Minimal Recursion Semantics (MRS). It automates grammar implementation and is thus ...
    • A linguistic approach for determining the topics of Spanish Twitter messages 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (SAGE Publications & CILIP, 2015)
      [Abstract]: The vast number of opinions and reviews provided in Twitter is helpful in order to make interesting findings about a given industry, but given the huge number of messages published every day, it is important ...
    • A non-projective greedy dependency parser with bidirectional LSTMs 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2017-08)
      [Abstract]: The LyS-FASTPARSE team present BIST-COVINGTON, a neural implementation of the Covington (2001) algorithm for non-projective dependency parsing. The bidirectional LSTM approach by Kiperwasser and Goldberg (2016) ...
    • A review on political analysis and social media 

      Vilares, David; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016)
      [Abstract] In democratic countries, forecasting the voting intentions of citizens and knowing their opinions on major political parties and leaders is of great interest to the parties themselves, to the media, and to the ...
    • A syntactic approach for opinion mining on Spanish reviews 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Cambridge University Press, 2015-01)
      [Abstract]: We describe an opinion mining system which classifies the polarity of Spanish texts. We propose an NLP approach that undertakes pre-processing, tokenisation and POS tagging of texts to then obtain the syntactic ...
    • Absolute convergence and error thresholds in non-active adaptive sampling 

      Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Vilares, Jesús (Elsevier Inc., 2022-05)
      [Abstract] Non-active adaptive sampling is a way of building machine learning models from a training data base which are supposed to dynamically and automatically derive guaranteed sample size. In this context and regardless ...
    • Alternances actantielles et la montée du possesseur: une étude de cas en espagnol 

      Alonso-Ramos, Margarita (2009)
      [Resumen]Este artículo estudia la realización sintáctica de los poseedores de un objeto directo como dependiente sintáctico del verbo, es decir, lo que se conoce como “el ascenso del poseedor”: besar a María en la frente ...
    • Any papyrus about "a hand over a stool and a bread loaf, followed by a boat"? Dealing with hieroglyphic texts in IR 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (ACM International Conference Proceeding Series, 2016-06)
      [Abstract] Digital Heritage deals with the use of computing and information technologies for the preservation and study of the human cultural legacy. Within this context, we present here a Text Retrieval system developed ...
    • Una aproximación supervisada para la minería de opiniones sobre tuits en español en base a conocimiento lingüístico 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: En este artículo se describe un sistema para la clasificación de la polaridad de tuits escritos en español. Se adopta una aproximación híbrida, que combina conocimiento lingüístico obtenido mediante PLN con ...
    • Building a New Sentiment Analysis Dataset for Uzbek Language and Creating Baseline Models 

      Kuriyozov, Elmurod; Matlatipov, Sanatbek (2019-08-02)
      [Abstract] Making natural language processing technologies available for low-resource languages is an important goal to improve the access to technology in their communities of speakers. In this paper, we provide the first ...
    • Clasificación de polaridad en textos con opiniones en español mediante análisis sintáctico de dependencias 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: En este artículo se describe un sistema de minería de opiniones que clasifica la polaridad de textos en español. Se propone una aproximación basada en PLN que conlleva realizar una segmentación, tokenización y ...
    • Cognitive Constraints Built into Formal Grammars: Implications for Language Evolution 

      Gómez-Rodríguez, Carlos; Christiansen, Morten H.; Ferrer-i-Cancho, Ramon (Ravignani, A., Barbieri, C., Martins, M., Flaherty, M., Jadoul, Y., Lattenkamp, E., Little, H., Mudd, K., Verhoef, T., 2020-04-17)
      [Abstract] We study the validity of the cognitive independence assumption using an ensemble of artificial syntactic structures from various classes of dependency grammars. Our findings show that memory limitations have ...
    • Constituent Parsing as Sequence Labeling 

      Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics (ACL), 2018)
      [Absctract]: We introduce a method to reduce constituent parsing to sequence labeling. For each word wt, it generates a label that encodes: (1) the number of ancestors in the tree that the words wt and wt+1 have in common, ...
    • Construcción de una lista de colocaciones para medir la competencia colocacional 

      Orol-González, Ana (Centro Virtual Cervantes, 2015)
      [Abstrac] The aim of this work is to create a list of Spanish collocations with assessment purpose. For the creation of this list we have followed a set of previously established criteria which are based on lists of frequent ...
    • Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego 

      García, Marcos; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En este trabajo presentamos una nueva estrategia para crear treebanks de lenguas con pocos recursos para el análisis sintáctico. El método consiste en la adaptación y combinación de diferentes treebanks anotados ...
    • Dependency parsing with bottom-up Hierarchical Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)
      [Abstract] Dependency parsing is a crucial step towards deep language understanding and, therefore, widely demanded by numerous Natural Language Processing applications. In particular, left-to-right and top-down transition-based ...
    • Detecting Perspectives in Political Debates 

      Vilares, David; He, Yulan (Association for Computational Linguistics, 2017-09)
      [Abstract]: We explore how to detect people’s perspectives that occupy a certain proposition. We propose a Bayesian modelling approach where topics (or propositions) and their associated perspectives (or viewpoints) are ...
    • Developing Open-Source Roguelike Games for Visually-Impaired Players by Using Low-Complexity NLP Techniques 

      Fernández-Núñez, Luis; Penas, Darío; Viteri Letamendía, Jorge; Vilares, Jesús (MDPI, 2020-08-19)
      [Abstract] The prominent graphic component of video games greatly limits the accessibility of thistype of entertainment by visually impaired users. We make here an overview of the first gamesdeveloped within an initiative ...
    • Discontinuous grammar as a foreign language 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)
      [Abstract] In order to achieve deep natural language understanding, syntactic constituent parsing is a vital step, highly demanded by many artificial intelligence systems to process both text and speech. One of the most ...
    • Discovering Topics in Twitter About the COVID-19 Outbreak in Spain 

      Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Sociedad Española de Procesamiento del Lenguaje Natural, 2021)
      [Resumen] En este trabajo, analizamos lo que los usuarios han estado discutiendo en Twitter durante el comienzo de la pandemia causada por el COVID-19. Concretamente, analizamos tres fases diferenciadas de la crisis del ...