Picture for Barbara Plank

Barbara Plank

Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA

Add code
Mar 16, 2026
Viaarxiv icon

Indirect Question Answering in English, German and Bavarian: A Challenging Task for High- and Low-Resource Languages Alike

Add code
Mar 16, 2026
Viaarxiv icon

LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models

Add code
Feb 06, 2026
Viaarxiv icon

When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training

Add code
Jan 30, 2026
Viaarxiv icon

Controlling Reading Ease with Gaze-Guided Text Generation

Add code
Jan 25, 2026
Viaarxiv icon

SteerEval: Inference-time Interventions Strengthen Multilingual Generalization in Neural Summarization Metrics

Add code
Jan 22, 2026
Viaarxiv icon

Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective

Add code
Jan 06, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

EVADE: LLM-Based Explanation Generation and Validation for Error Detection in NLI

Add code
Nov 12, 2025
Viaarxiv icon

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Add code
Oct 14, 2025
Figure 1 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 2 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 3 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 4 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Viaarxiv icon