Picture for Anna Korhonen

Anna Korhonen

Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models

Add code
Jun 06, 2025
Viaarxiv icon

Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Add code
May 29, 2025
Viaarxiv icon

Visual Planning: Let's Think Only with Images

Add code
May 16, 2025
Viaarxiv icon

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Add code
Feb 04, 2025
Viaarxiv icon

A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI

Add code
Dec 18, 2024
Viaarxiv icon

How far can bias go? -- Tracing bias from pretraining data to alignment

Add code
Nov 28, 2024
Viaarxiv icon

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

Add code
Aug 30, 2024
Viaarxiv icon

Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology

Add code
Aug 07, 2024
Viaarxiv icon

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish

Add code
Jul 17, 2024
Figure 1 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 2 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 3 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 4 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Viaarxiv icon

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

Add code
Jun 25, 2024
Viaarxiv icon