Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nelson Filipe Costa

Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Aug 28, 2025

Nelson Filipe Costa, Leila Kosseim

Figure 1 for Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Figure 2 for Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Figure 3 for Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Figure 4 for Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning

Abstract:This paper introduces the first multi-lingual and multi-label classification model for implicit discourse relation recognition (IDRR). Our model, HArch, is evaluated on the recently released DiscoGeM 2.0 corpus and leverages hierarchical dependencies between discourse senses to predict probability distributions across all three sense levels in the PDTB 3.0 framework. We compare several pre-trained encoder backbones and find that RoBERTa-HArch achieves the best performance in English, while XLM-RoBERTa-HArch performs best in the multi-lingual setting. In addition, we compare our fine-tuned models against GPT-4o and Llama-4-Maverick using few-shot prompting across all language configurations. Our results show that our fine-tuned models consistently outperform these LLMs, highlighting the advantages of task-specific fine-tuning over prompting in IDRR. Finally, we report SOTA results on the DiscoGeM 1.0 corpus, further validating the effectiveness of our hierarchical approach.

* Published at SIGDIAL 2025. Best paper award

Via

Access Paper or Ask Questions

A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

Aug 16, 2024

Nelson Filipe Costa, Leila Kosseim

Figure 1 for A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

Figure 2 for A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

Figure 3 for A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

Figure 4 for A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition

Abstract:In this work, we address the inherent ambiguity in Implicit Discourse Relation Recognition (IDRR) by introducing a novel multi-task classification model capable of learning both multi-label and single-label representations of discourse relations. Leveraging the DiscoGeM corpus, we train and evaluate our model on both multi-label and traditional single-label classification tasks. To the best of our knowledge, our work presents the first truly multi-label classifier in IDRR, establishing a benchmark for multi-label classification and achieving SOTA results in single-label classification on DiscoGeM. Additionally, we evaluate our model on the PDTB 3.0 corpus for single-label classification without any prior exposure to its data. While the performance is below the current SOTA, our model demonstrates promising results indicating potential for effective transfer learning across both corpora.

Via

Access Paper or Ask Questions