Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

May 29, 2025

Beiduo Chen, Yang Janet Liu, Anna Korhonen, Barbara Plank

Figure 1 for Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Figure 2 for Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Figure 3 for Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Figure 4 for Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Share this with someone who'll enjoy it:

Abstract:The recent rise of reasoning-tuned Large Language Models (LLMs)--which generate chains of thought (CoTs) before giving the final answer--has attracted significant attention and offers new opportunities for gaining insights into human label variation, which refers to plausible differences in how multiple annotators label the same data instance. Prior work has shown that LLM-generated explanations can help align model predictions with human label distributions, but typically adopt a reverse paradigm: producing explanations based on given answers. In contrast, CoTs provide a forward reasoning path that may implicitly embed rationales for each answer option, before generating the answers. We thus propose a novel LLM-based pipeline enriched with linguistically-grounded discourse segmenters to extract supporting and opposing statements for each answer option from CoTs with improved accuracy. We also propose a rank-based HLV evaluation framework that prioritizes the ranking of answers over exact scores, which instead favor direct comparison of label distributions. Our method outperforms a direct generation method as well as baselines on three datasets, and shows better alignment of ranking methods with humans, highlighting the effectiveness of our approach.

* 22 pages, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Paper and Code