Listar Lingua e Sociedade da Información (Language in the Information Society) (LYS) por título

EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)

[Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...

Entity linking with distributional semantics

Gamallo, Pablo; García, Marcos (Springer, 2016-07)

[Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...

Exploring cross-lingual word embeddings for the inference of bilingual dictionaries

García, Marcos; García Salido, Marcos; Alonso, Miguel A. (CEUR Workshop Proceedings, 2019)

[Abstract]: We describe four systems to generate automatically bilingual dictionaries based on existing ones: three transitive systems differing only in the pivot language used, and a system based on a different ...

Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy

Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier B.V., 2019-10)

[Absctract]: An increasingly wide range of artificial intelligence applications rely on syntactic information to process and extract meaning from natural language text or speech, with constituent trees being one of the ...

From Partial to Strictly Incremental Constituent Parsing

Ezquerro, Ana; Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics, 2024-03)

[Absctract]: We study incremental constituent parsers to assess their capacity to output trees based on prefix representations alone. Guided by strictly left-to-right generative language models and tree-decoding modules, ...

From Tokens to Trees: Mapping Syntactic Structures in the Deserts of Data-Scarce Languages

Vilares, David; Muñoz Ortiz, Alberto (CEUR-WS, 2024-06)

[Abstract]: Low-resource learning in natural language processing focuses on developing effective resources, tools, and technologies for languages that are less popular within the industry and academia. This effort is crucial ...

Global Transition-based Non-projective Dependency Parsing

Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)

[Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...

GRALENIA: Antimicrobial Resistance Management based on Natural Language and Artificial Intelligence

Bernardo-Castiñeira, Cristóbal; Bou, Germán; Campos, Manuel; Cánovas-Segura, Bernardo; Figueiras Gómez, Sergio; Gómez-Rodríguez, Carlos; Míguez-Rey, Enrique; Vilares, Jesús (CEUR-WS, 2024)

[Abstract]: The objective of GRALENIA project is to develop a multidisciplinary, comprehensive and interoperable platform incorporating artificial intelligence algorithms and natural language processing techniques to improve ...

Grammar Assistance Using Syntactic Structures (GAUSS)

Zamaraeva, Olga; Suárez Allegue, Lorena; Gómez-Rodríguez, Carlos; Alonso-Ramos, Margarita; Ogneva, Anastasiia (CEUR-WS, 2024)

[Abstract]: Automatic grammar coaching serves an important purpose of advising on standard grammar varieties while not imposing social pressures or reinforcing established social roles. Such systems already exist but most ...

Grounding the Semantics of Part-of-Day Nouns Worldwide using Twitter

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2018-06)

[Absctract]: The usage of part-of-day nouns, such as ‘night’, and their time-specific greetings (‘good night’), varies across languages and cultures. We show the possibilities that Twitter offers for studying the semantics ...

Harry Potter and the Action Prediction Challenge from Natural Language

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-06)

[Absctract]: We explore the challenge of action prediction from textual descriptions of scenes, a testbed to approximate whether text inference can be used to predict upcoming actions. As a case of study, we consider the ...

HARTAes-vas: Lexical combinations for an academic writing aid tool in Spanish and Basque

Alonso-Ramos, Margarita; Zabala, Igone (CEUR-WS.org, 2022)

[Abstract] Academic writing has become a priority object of study especially in English, for which there are already many resources to help novice writers. This is not the case for Spanish university students who do not ...

HEAD-QA: A Healthcare Dataset for Complex Reasoning

Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)

[Absctract]: We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and ...

Una herramienta para la ayuda a la redacción de textos académicos (HARTA) como uso de las TIC en el proceso de escritura

Guzzi, Eleonora; Alonso-Ramos, Margarita (REDINE (Red de Investigación e Innovación Educativa), 2020)

[Resumen] Los estudiantes universitarios españoles tienen que hacer frente a un proceso de escritura en el que no solo tienen que plasmar su conocimiento disciplinar, sino que tienen que hacerlo en un tipo de registro ...

How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis

Gómez-Rodríguez, Carlos; Alonso-Alonso, Iago; Vilares, David (Springer, 2019)

[Abstract]: Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language ...

Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico

Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)

[Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...

Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level

García, Marcos (European Language Resources Association (ELRA), 2016-05)

[Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...