Listar Lingua e Sociedade da Información (Language in the Information Society) (LYS) por título

Building a New Sentiment Analysis Dataset for Uzbek Language and Creating Baseline Models

Kuriyozov, Elmurod; Matlatipov, Sanatbek (2019-08-02)

[Abstract] Making natural language processing technologies available for low-resource languages is an important goal to improve the access to technology in their communities of speakers. In this paper, we provide the first ...

Clasificación de polaridad en textos con opiniones en español mediante análisis sintáctico de dependencias

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)

[Resumen]: En este artículo se describe un sistema de minería de opiniones que clasifica la polaridad de textos en español. Se propone una aproximación basada en PLN que conlleva realizar una segmentación, tokenización y ...

Cognitive Constraints Built into Formal Grammars: Implications for Language Evolution

Gómez-Rodríguez, Carlos; Christiansen, Morten H.; Ferrer-i-Cancho, Ramon (Ravignani, A., Barbieri, C., Martins, M., Flaherty, M., Jadoul, Y., Lattenkamp, E., Little, H., Mudd, K., Verhoef, T., 2020-04-17)

[Abstract] We study the validity of the cognitive independence assumption using an ensemble of artificial syntactic structures from various classes of dependency grammars. Our findings show that memory limitations have ...

Constituent Parsing as Sequence Labeling

Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics (ACL), 2018)

[Absctract]: We introduce a method to reduce constituent parsing to sequence labeling. For each word wt, it generates a label that encodes: (1) the number of ancestors in the tree that the words wt and wt+1 have in common, ...

Construcción de una lista de colocaciones para medir la competencia colocacional

Orol-González, Ana (Centro Virtual Cervantes, 2015)

[Abstrac] The aim of this work is to create a list of Spanish collocations with assessment purpose. For the creation of this list we have followed a set of previously established criteria which are based on lists of frequent ...

Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego

García, Marcos; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)

[Resumen] En este trabajo presentamos una nueva estrategia para crear treebanks de lenguas con pocos recursos para el análisis sintáctico. El método consiste en la adaptación y combinación de diferentes treebanks anotados ...

Dependency parsing with bottom-up Hierarchical Pointer Networks

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)

[Abstract] Dependency parsing is a crucial step towards deep language understanding and, therefore, widely demanded by numerous Natural Language Processing applications. In particular, left-to-right and top-down transition-based ...

Detecting Perspectives in Political Debates

Vilares, David; He, Yulan (Association for Computational Linguistics, 2017-09)

[Abstract]: We explore how to detect people’s perspectives that occupy a certain proposition. We propose a Bayesian modelling approach where topics (or propositions) and their associated perspectives (or viewpoints) are ...

Developing Open-Source Roguelike Games for Visually-Impaired Players by Using Low-Complexity NLP Techniques

Fernández-Núñez, Luis; Penas, Darío; Viteri Letamendía, Jorge; Vilares, Jesús (MDPI, 2020-08-19)

[Abstract] The prominent graphic component of video games greatly limits the accessibility of thistype of entertainment by visually impaired users. We make here an overview of the first gamesdeveloped within an initiative ...

Discontinuous grammar as a foreign language

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier, 2023-03)

[Abstract] In order to achieve deep natural language understanding, syntactic constituent parsing is a vital step, highly demanded by many artificial intelligence systems to process both text and speech. One of the most ...

Discovering Topics in Twitter About the COVID-19 Outbreak in Spain

Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Sociedad Española de Procesamiento del Lenguaje Natural, 2021)

[Resumen] En este trabajo, analizamos lo que los usuarios han estado discutiendo en Twitter durante el comienzo de la pandemia causada por el COVID-19. Concretamente, analizamos tres fases diferenciadas de la crisis del ...

EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)

[Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...

Entity linking with distributional semantics

Gamallo, Pablo; García, Marcos (Springer, 2016-07)

[Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...

Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier B.V., 2019-10)

[Absctract]: An increasingly wide range of artificial intelligence applications rely on syntactic information to process and extract meaning from natural language text or speech, with constituent trees being one of the ...

Global Transition-based Non-projective Dependency Parsing

Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)

[Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...

How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis

Gómez-Rodríguez, Carlos; Alonso-Alonso, Iago; Vilares, David (Springer, 2019)

[Abstract]: Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language ...

Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico

Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)

[Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...

Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level

García, Marcos (European Language Resources Association (ELRA), 2016-05)

[Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...

Increasing NLP Parsing Efficiency with Chunking

Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)

[Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...

Intelligent retrieval for biodiversity

Vilares Ferro, Manuel; Fernández, Milagros; Blanco, Adrián; Gómez-Rodríguez, Carlos (2016-02)

[Abstract] A knowledge discovery and representation frame to mine contents in systems biology is described. It applies natural language processing to integrate linguistic and domain knowledge in a mathematical model for ...