NFDI4DS | UHH-SEMS - Publication Details

Carlos Gómez‐Rodríguez

ORCID: 0000-0003-0752-8812

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5030874155

Research Areas

Natural Language Processing Techniques
Topic Modeling
Graph theory and applications
Advanced Graph Theory Research
Graph Theory and Algorithms
Text Readability and Simplification
Sentiment Analysis and Opinion Mining
Software Engineering Research
Semantic Web and Ontologies
Algorithms and Data Compression
Speech and dialogue systems
Advanced Text Analysis Techniques
semigroups and automata theory
Language and cultural evolution
Authorship Attribution and Profiling
Theoretical and Computational Physics
Speech Recognition and Synthesis
Multimodal Machine Learning Applications
Logic, programming, and type systems
Linguistic Variation and Morphology
Genomics and Phylogenetic Studies
Complex Network Analysis Techniques
Machine Learning in Bioinformatics
Biomedical Text Mining and Ontologies
Spanish Linguistics and Language Studies

Universidade da Coruña
2015-2024

Universidad de Guanajuato
2022-2024

University of the Sunshine Coast
2023

CITIC Group (China)
2019-2022

Universidade de Vigo
2006-2019

Laboratoire de Biotechnologie et Chimie Marines
1970

Sentiment Analysis for Fake News Detection

OPENALEX - Publications

Miguel Á. Alonso David Vilares Carlos Gómez‐Rodríguez Jesús Vilares

In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention deception. The dissemination this type news poses serious threat to cohesion and social well-being, since it fosters political polarization distrust people respect their leaders. huge amount that is disseminated through media makes manual verification unfeasible, which has promoted design implementation automatic systems for detection. creators use various stylistic...

10.3390/electronics10111348 article EN Electronics 2021-06-05

Supervised sentiment analysis in multilingual environments

OPENALEX - Publications

David Vilares Miguel Á. Alonso Carlos Gómez‐Rodríguez

10.1016/j.ipm.2017.01.004 article EN Information Processing & Management 2017-03-01

Constituent Parsing as Sequence Labeling

OPENALEX - Publications

Carlos Gómez‐Rodríguez David Vilares

We introduce a method to reduce constituent parsing sequence labeling. For each word wt, it generates label that encodes: (1) the number of ancestors in tree words wt and wt+1 have common, (2) nonterminal symbol at lowest common ancestor. first prove proposed encoding function is injective for any without unary branches. In practice, approach made extensible all constituency trees by collapsing then use PTB CTB treebanks as testbeds propose set fast baselines. achieve 90% F-score on test...

10.18653/v1/d18-1162 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

Viable Dependency Parsing as Sequence Labeling

OPENALEX - Publications

Michalina Strzyz David Vilares Carlos Gómez‐Rodríguez

Michalina Strzyz, David Vilares, Carlos Gómez-Rodríguez. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1077 article EN 2019-01-01

Optimality of syntactic dependency distances

OPENALEX - Publications

Ramon Ferrer‐i‐Cancho Carlos Gómez‐Rodríguez Juan Luis Esteban Lluís Alemany-Puig

It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts quantify the degree of optimality languages means an score have been scarce and focused mostly on English. Here we recast problem word order a sentence optimization spatial network where vertices words, arcs indicate syntactic dependencies, space defined linear words in sentence. We introduce cognitive pressure reduce distance between linked The analysis...

10.1103/physreve.105.014308 article EN Physical review. E 2022-01-18

Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora

OPENALEX - Publications

David Vilares Miguel Á. Alonso Carlos Gómez‐Rodríguez

We address the problem of performing polarity classification on Twitter over different languages, focusing English and Spanish, comparing three techniques: (1) a monolingual model which knows language in opinion is written, (2) that acts based decision provided by identification tool (3) multilingual trained dataset does not need any recognition step.Results show models are even able to outperform some sets.We introduce first code-switching corpus with sentiment labels, showing robustness approach.

10.18653/v1/w15-2902 article EN cc-by 2015-01-01

A syntactic approach for opinion mining on Spanish reviews

OPENALEX - Publications

David Vilares Miguel Á. Alonso Carlos Gómez‐Rodríguez

Abstract We describe an opinion mining system which classifies the polarity of Spanish texts. propose NLP approach that undertakes pre-processing, tokenisation and POS tagging texts to then obtain syntactic structure sentences by means a dependency parser. This is used address three most significant linguistic constructions for purpose in question: intensification, subordinate adversative clauses negation. also semi-automatic domain adaptation method improve accuracy our specific application...

10.1017/s1351324913000181 article EN Natural Language Engineering 2013-08-09

Universal, unsupervised (rule-based), uncovered sentiment analysis

OPENALEX - Publications

David Vilares Carlos Gómez‐Rodríguez Miguel Á. Alonso

10.1016/j.knosys.2016.11.014 article EN Knowledge-Based Systems 2016-11-23

On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages

OPENALEX - Publications

David Vilares Miguel Á. Alonso Carlos Gómez‐Rodríguez

Millions of micro texts are published every day on T witter. Identifying the sentiment present in them can be helpful for measuring frame mind public, their satisfaction with respect to a product, or support social event. In this context, polarity classification is subfield analysis focused determining whether content text objective subjective, and latter case, if it conveys positive negative opinion. Most detection techniques tend take into account individual terms even some degree...

10.1002/asi.23284 article EN Journal of the Association for Information Science and Technology 2015-04-29

Left-to-Right Dependency Parsing with Pointer Networks

OPENALEX - Publications

Daniel Fernández‐González Carlos Gómez‐Rodríguez

Daniel Fernández-González, Carlos Gómez-Rodríguez. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1076 article EN 2019-01-01

A Confederacy of Models: a Comprehensive Evaluation of LLMs on Creative Writing

OPENALEX - Publications

Carlos Gómez‐Rodríguez Paul Williams

We evaluate a range of recent LLMs on English creative writing, challenging and complex task that requires imagination, coherence, style. use difficult, open-ended scenario chosen to avoid training data reuse: an epic narration single combat between Ignatius J. Reilly, the protagonist Pulitzer Prize-winning novel A Confederacy Dunces (1980), pterodactyl, prehistoric flying reptile. ask several humans write such story conduct human evalution involving various criteria as fluency, originality,...

10.18653/v1/2023.findings-emnlp.966 article EN cc-by 2023-01-01

Contrasting Linguistic Patterns in Human and LLM-Generated News Text

OPENALEX - Publications

Alberto Muñoz-Ortiz Carlos Gómez‐Rodríguez David Vilares

<title>Abstract</title> We conduct a quantitative analysis contrasting human-written English news text with comparable large language model (LLM) output from six different LLMs that cover three families and four sizes in total. Our spans several measurable linguistic dimensions, including morphological, syntactic, psychometric, sociolinguistic aspects. The results reveal various differences between human AI-generated texts. Human texts exhibit more scattered sentence length distributions,...

10.21203/rs.3.rs-4077382/v1 preprint EN cc-by Research Square (Research Square) 2024-03-14

El empowerment en desempeño del personal de mantenimiento en exploración y producción en unidad petrolera

OPENALEX - Publications

Carlos Gómez‐Rodríguez

En el contexto de las industrias del petróleo y gas en tabasco, Mexico,en lo relativo al Empowerrment, se hace relevante determinar la relación entre organizaciones que son algo más simple suma partes un sistema actividades formadas por dos o personas –capital humano–, donde es esencial cooperación ellas para puedan existir, existe estrategia empresarial Empowerrment (en español: empoderamiento), consiste delegar autoridad responsabilidad a los individuos., dar poder, tareas “líder “a...

10.59169/pentaciencias.v17i2.1449 article ES cc-by-nc-sa Revista Científica Arbitrada Multidisciplinaria PENTACIENCIAS 2025-02-21

Divisible Transition Systems and Multiplanar Dependency Parsing

OPENALEX - Publications

Carlos Gómez‐Rodríguez Joakim Nivre

Transition-based parsing is a widely used approach for dependency that combines high efficiency with expressive feature models. Many different transition systems have been proposed, often formalized in slightly frameworks. In this article, we show large number of the known projective can be viewed as variants same stack-based system small set elementary transitions composed into complex and restricted ways. We call these divisible prove theoretical results about their expressivity...

10.1162/coli_a_00150 article EN Computational Linguistics 2013-01-03

One model, two languages: training bilingual parsers with harmonized treebanks

OPENALEX - Publications

David Vilares Carlos Gómez‐Rodríguez Miguel Á. Alonso

We introduce an approach to train lexicalized parsers using bilingual corpora obtained by merging harmonized treebanks of different languages, producing that can analyze sentences in either the learned or even mix both. test on Universal Dependency Treebanks, training with MaltParser and MaltOptimizer. The results show these are more than competitive, as most combinations not only preserve accuracy, but some achieve significant improvements over corresponding monolingual parsers. Preliminary...

10.18653/v1/p16-2069 article EN 2016-01-01

HEAD-QA: A Healthcare Dataset for Complex Reasoning

OPENALEX - Publications

David Vilares Carlos Gómez‐Rodríguez

We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams access specialized position in the Spanish healthcare system, and are challenging even for highly humans. then consider monolingual (Spanish) cross-lingual (to English) experiments with information retrieval neural techniques. show that: (i) HEAD-QA challenges current methods, (ii) results lag well behind human performance, demonstrating its usefulness as...

10.18653/v1/p19-1092 article EN cc-by 2019-01-01

Parsing as Pretraining

OPENALEX - Publications

David Vilares Michalina Strzyz Anders Søgaard Carlos Gómez‐Rodríguez

Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks word vectors still do not report results on standard setups such as constituent and dependency parsing. This paper addresses this problem does full parsing (on English) relying only pretraining architectures – no decoding. We first cast sequence tagging. then use a single feed-forward layer to directly map labels encode linearized tree. is used to:...

10.1609/aaai.v34i05.6446 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing

OPENALEX - Publications

Carlos Gómez‐Rodríguez John M. Carroll David Weir

We introduce dependency parsing schemata, a formal framework based on Sikkel's schemata for constituency parsers, which can be used to describe, analyze, and compare algorithms. use this describe several well-known projective non-projective build correctness proofs, establish relationships between them. then the define new polynomial-time algorithms various mildly formalisms, including well-nested structures with their gap degree bounded by constant k in time O(n 5+2k ), class that includes...

10.1162/coli_a_00060 article EN cc-by-nc-nd Computational Linguistics 2011-03-24

A Polynomial-Time Dynamic Oracle for Non-Projective Dependency Parsing

OPENALEX - Publications

Carlos Gómez‐Rodríguez Francesco Sartorio Giorgio Satta

The introduction of dynamic oracles has considerably improved the accuracy greedy transition-based dependency parsers, without sacrificing parsing efficiency.However, this enhancement is limited to projective parsing, and have not yet been implemented for parsers supporting non-projectivity.In paper we introduce first such oracle, a non-projective parser based on Attardi's parser.We show that training with oracle improves over conventional (static) wide range datasets.

10.3115/v1/d14-1099 article EN cc-by 2014-01-01

Coming Soon ...