Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Vilares

HEAD-QA v2: Expanding a Healthcare Benchmark for Reasoning

Nov 19, 2025

Alexis Correa-Guillén, Carlos Gómez-Rodríguez, David Vilares

Abstract:We introduce HEAD-QA v2, an expanded and updated version of a Spanish/English healthcare multiple-choice reasoning dataset originally released by Vilares and Gómez-Rodríguez (2019). The update responds to the growing need for high-quality datasets that capture the linguistic and conceptual complexity of healthcare reasoning. We extend the dataset to over 12,000 questions from ten years of Spanish professional exams, benchmark several open-source LLMs using prompting, RAG, and probability-based answer selection, and provide additional multilingual versions to support future work. Results indicate that performance is mainly driven by model scale and intrinsic reasoning ability, with complex inference strategies obtaining limited gains. Together, these results establish HEAD-QA v2 as a reliable resource for advancing research on biomedical reasoning and model improvement.

* Preprint. 12 pages

Via

Access Paper or Ask Questions

Hierarchical Bracketing Encodings Work for Dependency Graphs

Sep 11, 2025

Ana Ezquerro, Carlos Gómez-Rodríguez, David Vilares

Abstract:We revisit hierarchical bracketing encodings from a practical perspective in the context of dependency graph parsing. The approach encodes graphs as sequences, enabling linear-time parsing with $n$ tagging actions, and still representing reentrancies, cycles, and empty nodes. Compared to existing graph linearizations, this representation substantially reduces the label space while preserving structural information. We evaluate it on a multilingual and multi-formalism benchmark, showing competitive results and consistent improvements over other methods in exact match accuracy.

* Accepted at EMNLP 2025 (main)

Via

Access Paper or Ask Questions

Nested Named Entity Recognition as Single-Pass Sequence Labeling

May 22, 2025

Alberto Muñoz-Ortiz, David Vilares, Caio COrro, Carlos Gómez-Rodríguez

Abstract:We cast nested named entity recognition (NNER) as a sequence labeling task by leveraging prior work that linearizes constituency structures, effectively reducing the complexity of this structured prediction problem to straightforward token classification. By combining these constituency linearizations with pretrained encoders, our method captures nested entities while performing exactly $n$ tagging actions. Our approach achieves competitive performance compared to less efficient systems, and it can be trained using any off-the-shelf sequence labeling library.

* Submitted to EMNLP 2025

Via

Access Paper or Ask Questions

Hierarchical Bracketing Encodings for Dependency Parsing as Tagging

May 16, 2025

Ana Ezquerro, David Vilares, Anssi Yli-Jyrä, Carlos Gómez-Rodríguez

Abstract:We present a family of encodings for sequence labeling dependency parsing, based on the concept of hierarchical bracketing. We prove that the existing 4-bit projective encoding belongs to this family, but it is suboptimal in the number of labels used to encode a tree. We derive an optimal hierarchical bracketing, which minimizes the number of symbols used and encodes projective trees using only 12 distinct labels (vs. 16 for the 4-bit encoding). We also extend optimal hierarchical bracketing to support arbitrary non-projectivity in a more compact way than previous encodings. Our new encodings yield competitive accuracy on a diverse set of treebanks.

* Accepted to ACL 2025. Original submission; camera-ready coming soon

Via

Access Paper or Ask Questions

Better Benchmarking LLMs for Zero-Shot Dependency Parsing

Feb 28, 2025

Ana Ezquerro, Carlos Gómez-Rodríguez, David Vilares

Abstract:While LLMs excel in zero-shot tasks, their performance in linguistic challenges like syntactic parsing has been less scrutinized. This paper studies state-of-the-art open-weight LLMs on the task by comparing them to baselines that do not have access to the input sentence, including baselines that have not been used in this context such as random projective trees or optimal linear arrangements. The results show that most of the tested LLMs cannot outperform the best uninformed baselines, with only the newest and largest versions of LLaMA doing so for most languages, and still achieving rather low performance. Thus, accurate zero-shot syntactic parsing is not forthcoming with open LLMs.

* Accepted at NoDaLiDa/Baltic-HLT 2025

Via

Access Paper or Ask Questions

Dependency Graph Parsing as Sequence Labeling

Oct 23, 2024

Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez

Figure 1 for Dependency Graph Parsing as Sequence Labeling

Figure 2 for Dependency Graph Parsing as Sequence Labeling

Figure 3 for Dependency Graph Parsing as Sequence Labeling

Figure 4 for Dependency Graph Parsing as Sequence Labeling

Abstract:Various linearizations have been proposed to cast syntactic dependency parsing as sequence labeling. However, these approaches do not support more complex graph-based representations, such as semantic dependencies or enhanced universal dependencies, as they cannot handle reentrancy or cycles. By extending them, we define a range of unbounded and bounded linearizations that can be used to cast graph parsing as a tagging task, enlarging the toolbox of problems that can be solved under this paradigm. Experimental results on semantic dependency and enhanced UD parsing show that with a good choice of encoding, sequence-labeling dependency graph parsers combine high efficiency with accuracies close to the state of the art, in spite of their simplicity.

* Accepted at EMNLP-2024

Via

Access Paper or Ask Questions

Dancing in the syntax forest: fast, accurate and explainable sentiment analysis with SALSA

Jun 23, 2024

Carlos Gómez-Rodríguez, Muhammad Imran, David Vilares, Elena Solera, Olga Kellert

Abstract:Sentiment analysis is a key technology for companies and institutions to gauge public opinion on products, services or events. However, for large-scale sentiment analysis to be accessible to entities with modest computational resources, it needs to be performed in a resource-efficient way. While some efficient sentiment analysis systems exist, they tend to apply shallow heuristics, which do not take into account syntactic phenomena that can radically change sentiment. Conversely, alternatives that take syntax into account are computationally expensive. The SALSA project, funded by the European Research Council under a Proof-of-Concept Grant, aims to leverage recently-developed fast syntactic parsing techniques to build sentiment analysis systems that are lightweight and efficient, while still providing accuracy and explainability through the explicit use of syntax. We intend our approaches to be the backbone of a working product of interest for SMEs to use in production.

* Accepted for publication at SEPLN-CEDI2024: Seminar of the Spanish Society for Natural Language Processing at the 7th Spanish Conference on Informatics

Via

Access Paper or Ask Questions

LyS at SemEval-2024 Task 3: An Early Prototype for End-to-End Multimodal Emotion Linking as Graph-Based Parsing

May 10, 2024

Ana Ezquerro, David Vilares

Abstract:This paper describes our participation in SemEval 2024 Task 3, which focused on Multimodal Emotion Cause Analysis in Conversations. We developed an early prototype for an end-to-end system that uses graph-based methods from dependency parsing to identify causal emotion relations in multi-party conversations. Our model comprises a neural transformer-based encoder for contextualizing multimodal conversation data and a graph-based decoder for generating the adjacency matrix scores of the causal graph. We ranked 7th out of 15 valid and official submissions for Subtask 1, using textual inputs only. We also discuss our participation in Subtask 2 during post-evaluation using multi-modal inputs.

* Accepted at SemEval 2024

Via

Access Paper or Ask Questions

From Partial to Strictly Incremental Constituent Parsing

Feb 05, 2024

Ana Ezquerro, Carlos Gómez-Rodríguez, David Vilares

Figure 1 for From Partial to Strictly Incremental Constituent Parsing

Figure 2 for From Partial to Strictly Incremental Constituent Parsing

Figure 3 for From Partial to Strictly Incremental Constituent Parsing

Figure 4 for From Partial to Strictly Incremental Constituent Parsing

Abstract:We study incremental constituent parsers to assess their capacity to output trees based on prefix representations alone. Guided by strictly left-to-right generative language models and tree-decoding modules, we build parsers that adhere to a strong definition of incrementality across languages. This builds upon work that asserted incrementality, but that mostly only enforced it on either the encoder or the decoder. Finally, we conduct an analysis against non-incremental and partially incremental models.

* Accepted at EACL 2024

Via

Access Paper or Ask Questions

4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees

Oct 22, 2023

Carlos Gómez-Rodríguez, Diego Roca, David Vilares

Figure 1 for 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees

Figure 2 for 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees

Figure 3 for 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees

Figure 4 for 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees

Abstract:We introduce an encoding for parsing as sequence labeling that can represent any projective dependency tree as a sequence of 4-bit labels, one per word. The bits in each word's label represent (1) whether it is a right or left dependent, (2) whether it is the outermost (left/right) dependent of its parent, (3) whether it has any left children and (4) whether it has any right children. We show that this provides an injective mapping from trees to labels that can be encoded and decoded in linear time. We then define a 7-bit extension that represents an extra plane of arcs, extending the coverage to almost full non-projectivity (over 99.9% empirical arc coverage). Results on a set of diverse treebanks show that our 7-bit encoding obtains substantial accuracy gains over the previously best-performing sequence labeling encodings.

* Accepted for publication at EMNLP 2023

Via

Access Paper or Ask Questions