Listar GI-LYS - Artigos por título

20 years of the Grammar Matrix: cross-linguistic hypothesis testing of increasingly complex interactions

Zamaraeva, Olga; Curtis, Chris; Emerson, Guy; Fokkens, Antske; Goodman, Michael Wayne; Howell, Kristen; Trimble, T.J.; Bender, Emily M. (Institute of Computer Science, Polish Academy of Sciences, 2022-10-20)

[Abstract] The Grammar Matrix project is a meta-grammar engineering framework expressed in Head-driven Phrase Structure Grammar (HPSG) and Minimal Recursion Semantics (MRS). It automates grammar implementation and is thus ...

A linguistic approach for determining the topics of Spanish Twitter messages

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (SAGE Publications & CILIP, 2015)

[Abstract]: The vast number of opinions and reviews provided in Twitter is helpful in order to make interesting findings about a given industry, but given the huge number of messages published every day, it is important ...

A review on political analysis and social media

Vilares, David; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016)

[Abstract] In democratic countries, forecasting the voting intentions of citizens and knowing their opinions on major political parties and leaders is of great interest to the parties themselves, to the media, and to the ...

A syntactic approach for opinion mining on Spanish reviews

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Cambridge University Press, 2015-01)

[Abstract]: We describe an opinion mining system which classifies the polarity of Spanish texts. We propose an NLP approach that undertakes pre-processing, tokenisation and POS tagging of texts to then obtain the syntactic ...

Absolute convergence and error thresholds in non-active adaptive sampling

Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Vilares, Jesús (Elsevier Inc., 2022-05)

[Abstract] Non-active adaptive sampling is a way of building machine learning models from a training data base which are supposed to dynamically and automatically derive guaranteed sample size. In this context and regardless ...

Alternances actantielles et la montée du possesseur: une étude de cas en espagnol

Alonso-Ramos, Margarita (2009)

[Resumen]Este artículo estudia la realización sintáctica de los poseedores de un objeto directo como dependiente sintáctico del verbo, es decir, lo que se conoce como “el ascenso del poseedor”: besar a María en la frente ...

Una aproximación supervisada para la minería de opiniones sobre tuits en español en base a conocimiento lingüístico

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)

[Resumen]: En este artículo se describe un sistema para la clasificación de la polaridad de tuits escritos en español. Se adopta una aproximación híbrida, que combina conocimiento lingüístico obtenido mediante PLN con ...

Asignación de niveles de aprendizaje a las colocaciones del Diccionario de Colocaciones del español

García Salido, Marcos; Alonso-Ramos, Margarita (Pontificia Universidad Católica de Valparaíso. Instituto de Literatura y Ciencias del Lenguaje, 2018)

[Resumen] Este artículo propone un método para nivelar las colocaciones del Diccionario de Colocaciones del Español de acuerdo con los niveles propuestos en el MCER. Como criterio nivelador se ...

Bertinho: Galician BERT Representations

Vilares, David; García, Marcos; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2021-03)

[Abstract]: This paper presents a monolingual BERT model for Galician. We follow the recent trend that shows that it is feasible to build robust monolingual BERT models even for relatively low-resource languages, while ...

Building a New Sentiment Analysis Dataset for Uzbek Language and Creating Baseline Models

Kuriyozov, Elmurod; Matlatipov, Sanatbek (2019-08-02)

[Abstract] Making natural language processing technologies available for low-resource languages is an important goal to improve the access to technology in their communities of speakers. In this paper, we provide the first ...

Clasificación de polaridad en textos con opiniones en español mediante análisis sintáctico de dependencias

Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)

[Resumen]: En este artículo se describe un sistema de minería de opiniones que clasifica la polaridad de textos en español. Se propone una aproximación basada en PLN que conlleva realizar una segmentación, tokenización y ...

Comparing neural- and N-gram-based language models for word segmentation

Doval, Yerai; Gómez-Rodríguez, Carlos (John Wiley and Sons Inc., 2019-02)

[Abstract]: Word segmentation is the task of inserting or deleting word boundary characters in order to separate character sequences that correspond to words in some language. In this article we propose an approach based ...

Construcción de una lista de colocaciones para medir la competencia colocacional

Orol-González, Ana (Centro Virtual Cervantes, 2015)

[Abstrac] The aim of this work is to create a list of Spanish collocations with assessment purpose. For the creation of this list we have followed a set of previously established criteria which are based on lists of frequent ...

Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego

García, Marcos; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)

[Resumen] En este trabajo presentamos una nueva estrategia para crear treebanks de lenguas con pocos recursos para el análisis sintáctico. El método consiste en la adaptación y combinación de diferentes treebanks anotados ...