Listar Lingua e Sociedade da Información (Language in the Information Society) (LYS) por título

Discovering Topics in Twitter About the COVID-19 Outbreak in Spain

Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Sociedad Española de Procesamiento del Lenguaje Natural, 2021)

[Resumen] En este trabajo, analizamos lo que los usuarios han estado discutiendo en Twitter durante el comienzo de la pandemia causada por el COVID-19. Concretamente, analizamos tres fases diferenciadas de la crisis del ...

EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)

[Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...

Entity linking with distributional semantics

Gamallo, Pablo; García, Marcos (Springer, 2016-07)

[Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...

Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier B.V., 2019-10)

[Absctract]: An increasingly wide range of artificial intelligence applications rely on syntactic information to process and extract meaning from natural language text or speech, with constituent trees being one of the ...

From Partial to Strictly Incremental Constituent Parsing

Ezquerro, Ana; Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics, 2024-03)

[Absctract]: We study incremental constituent parsers to assess their capacity to output trees based on prefix representations alone. Guided by strictly left-to-right generative language models and tree-decoding modules, ...

Global Transition-based Non-projective Dependency Parsing

Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)

[Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...

Grounding the Semantics of Part-of-Day Nouns Worldwide using Twitter

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2018-06)

[Absctract]: The usage of part-of-day nouns, such as ‘night’, and their time-specific greetings (‘good night’), varies across languages and cultures. We show the possibilities that Twitter offers for studying the semantics ...

Harry Potter and the Action Prediction Challenge from Natural Language

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-06)

[Absctract]: We explore the challenge of action prediction from textual descriptions of scenes, a testbed to approximate whether text inference can be used to predict upcoming actions. As a case of study, we consider the ...

HEAD-QA: A Healthcare Dataset for Complex Reasoning

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)

[Absctract]: We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and ...

How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis

Gómez-Rodríguez, Carlos; Alonso-Alonso, Iago; Vilares, David (Springer, 2019)

[Abstract]: Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language ...

Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico

Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)

[Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...

Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level

García, Marcos (European Language Resources Association (ELRA), 2016-05)

[Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...

Increasing NLP Parsing Efficiency with Chunking

Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)

[Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...

Intelligent retrieval for biodiversity

Vilares Ferro, Manuel; Fernández, Milagros; Blanco, Adrián; Gómez-Rodríguez, Carlos (2016-02)

[Abstract] A knowledge discovery and representation frame to mine contents in systems biology is described. It applies natural language processing to integrate linguistic and domain knowledge in a mathematical model for ...

Left-to-Right Dependency Parsing with Pointer Networks

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2019)

[Abstract]: We propose a novel transition-based algorithm that straightforwardly parses sentences from left to right by building n attachments, with n being the length of the input sentence. Similarly to the recent ...

Liberating language research from dogmas of the 20th century

Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016)

[Abstract] A commentary on the article “Large -scale evidence of dependency length minimization in 37 languages” by Futrell, Mahowald & Gibson (PNAS 2015 112 (33) 10336-10341).

Lyapunov filtering of objectivity for Spanish sentiment model

Chaturvedi, Iti; Cambria, Erik; Vilares, David (IEEE, 2016-07)

[Abstract] Objective sentences lack sentiments and, hence, can reduce the accuracy of a sentiment classifier. Traditional methods prior to 2001 used hand-crafted templates to identify subjectivity and did not generalize ...

LyS A Coruña at GUA-SPA@IberLEF2023. Multi-Task Learning with Large Language Model Encoders for Guarani-Spanish Code Switching Analysis

Muñoz Ortiz, Alberto; Vilares, David (2023)

[Abstract] This paper introduces the LyS A Coruña proposal for the Guarani-Spanish Code Switching Analysis task at IberLEF2023. The shared task proposes to analyze Guarani-Spanish code-switched texts, focusing on language ...

LyS at SemEval-2016 Task 4: Exploiting Neural Activation Values for Twitter Sentiment Classification and Quantification

Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2016)

[Abstract]: In this paper we describe our deep learning approach for solving both two-, three- and fiveclass tweet polarity classification, and twoand five-class quantification. We first trained a convolutional neural ...

LyS at TASS 2013: Analysing Spanish tweets by means of dependency parsing, semantic-oriented lexicons and psychometric word-properties

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)

[Abstract]: This article describes the approach developed by our group in order to resolve the sentiment analysis at a global level, topic identification and political tendency classification tasks on Spanish tweets; ...