ListarLingua e Sociedade da Información (Language in the Information Society) (LYS) por tema "Natural language processing"

A linguistic approach for determining the topics of Spanish Twitter messages

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (SAGE Publications & CILIP, 2015)

[Abstract]: The vast number of opinions and reviews provided in Twitter is helpful in order to make interesting findings about a given industry, but given the huge number of messages published every day, it is important ...

A non-projective greedy dependency parser with bidirectional LSTMs

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2017-08)

[Abstract]: The LyS-FASTPARSE team present BIST-COVINGTON, a neural implementation of the Covington (2001) algorithm for non-projective dependency parsing. The bidirectional LSTM approach by Kiperwasser and Goldberg (2016) ...

A syntactic approach for opinion mining on Spanish reviews

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Cambridge University Press, 2015-01)

[Abstract]: We describe an opinion mining system which classifies the polarity of Spanish texts. We propose an NLP approach that undertakes pre-processing, tokenisation and POS tagging of texts to then obtain the syntactic ...

Building a New Sentiment Analysis Dataset for Uzbek Language and Creating Baseline Models

Kuriyozov, Elmurod; Matlatipov, Sanatbek (2019-08-02)

[Abstract] Making natural language processing technologies available for low-resource languages is an important goal to improve the access to technology in their communities of speakers. In this paper, we provide the first ...

Dependency parsing with bottom-up Hierarchical Pointer Networks

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)

[Abstract] Dependency parsing is a crucial step towards deep language understanding and, therefore, widely demanded by numerous Natural Language Processing applications. In particular, left-to-right and top-down transition-based ...

Detecting Perspectives in Political Debates

Vilares, David; He, Yulan (Association for Computational Linguistics, 2017-09)

[Abstract]: We explore how to detect people’s perspectives that occupy a certain proposition. We propose a Bayesian modelling approach where topics (or propositions) and their associated perspectives (or viewpoints) are ...

Discontinuous grammar as a foreign language

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)

[Abstract] In order to achieve deep natural language understanding, syntactic constituent parsing is a vital step, highly demanded by many artificial intelligence systems to process both text and speech. One of the most ...

Discovering Topics in Twitter About the COVID-19 Outbreak in Spain

Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Sociedad Española de Procesamiento del Lenguaje Natural, 2021)

[Resumen] En este trabajo, analizamos lo que los usuarios han estado discutiendo en Twitter durante el comienzo de la pandemia causada por el COVID-19. Concretamente, analizamos tres fases diferenciadas de la crisis del ...

Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier B.V., 2019-10)

[Absctract]: An increasingly wide range of artificial intelligence applications rely on syntactic information to process and extract meaning from natural language text or speech, with constituent trees being one of the ...

How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis

Gómez-Rodríguez, Carlos; Alonso-Alonso, Iago; Vilares, David (Springer, 2019)

[Abstract]: Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language ...

Increasing NLP Parsing Efficiency with Chunking

Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)

[Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...

Intelligent retrieval for biodiversity

Vilares Ferro, Manuel; Fernández, Milagros; Blanco, Adrián; Gómez-Rodríguez, Carlos (2016-02)

[Abstract] A knowledge discovery and representation frame to mine contents in systems biology is described. It applies natural language processing to integrate linguistic and domain knowledge in a mathematical model for ...

Multidimensional Affective Analysis for Low-Resource Languages: A Use Case with Guarani-Spanish Code-Switching Language

Agüero-Torales, Marvin M.; López-Herrera, Antonio G.; Vilares, David (Springer, 2023)

[Abstract]: This paper focuses on text-based affective computing for Jopara, a code-switching language that combines Guarani and Spanish. First, we collected a dataset of tweets primarily written in Guarani and annotated ...

Multitask Pointer Network for Multi-Representational Parsing

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2022-01-25)

[Abstract] Dependency and constituent trees are widely used by many artificial intelligence applications for representing the syntactic structure of human languages. Typically, these structures are separately produced by ...

On the Use of Parsing for Named Entity Recognition

Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021-01-25)

[Abstract] Parsing is a core natural language processing technique that can be used to obtain the structure underlying sentences in human languages. Named entity recognition (NER) is the task of identifying the entities ...

On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Wiley, 2015-09)

[Abstract]: Millions of micro texts are published every day on Twitter. Identifying the sentiment present in them can be helpful for measuring the frame of mind of the public, their satisfaction with respect to a product, ...

Optimality of syntactic dependency distances

Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos; Esteban, Juan Luis; Alemany-Puig, Lluís (American Physical Society, 2022-01)

[Abstract]: It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts to quantify the degree of optimality of languages by means of an optimality ...

Parsing as Pretraining

Vilares, David; Strzyz, Michalina; Søgaard, Anders; Gómez-Rodríguez, Carlos (2020)

[Abstract] Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks for word vectors still do not report results on standard setups ...

Semantic Relation Extraction. Resources, Tools and Strategies

García, Marcos (Springer, 2016-07)

[Abstract] Relation extraction is a subtask of information extraction that aims at obtaining instances of semantic relations present in texts. This information can be arranged in machine-readable formats, useful for several ...

Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2015)

[Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows ...