Picture for Miriam Wanner

Miriam Wanner

Can Reasoning Models Detect Changes to their Chains of Thought?

Add code
Jun 20, 2026
Viaarxiv icon

Weird Generalization is Weirdly Brittle

Add code
Apr 11, 2026
Viaarxiv icon

All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations

Add code
Oct 08, 2025
Figure 1 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 2 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 3 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 4 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Viaarxiv icon

Does Local News Stay Local?: Online Content Shifts in Sinclair-Acquired Stations

Add code
Oct 08, 2025
Viaarxiv icon

How Grounded is Wikipedia? A Study on Structured Evidential Support

Add code
Jun 14, 2025
Viaarxiv icon

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

Add code
Mar 27, 2025
Figure 1 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 2 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 3 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 4 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Viaarxiv icon

DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation

Add code
Dec 17, 2024
Figure 1 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 2 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 3 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 4 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Viaarxiv icon

Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification

Add code
Jul 04, 2024
Figure 1 for Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification
Figure 2 for Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification
Figure 3 for Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification
Figure 4 for Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification
Viaarxiv icon

A Closer Look at Claim Decomposition

Add code
Mar 18, 2024
Figure 1 for A Closer Look at Claim Decomposition
Figure 2 for A Closer Look at Claim Decomposition
Figure 3 for A Closer Look at Claim Decomposition
Figure 4 for A Closer Look at Claim Decomposition
Viaarxiv icon

Revisiting the Effects of Leakage on Dependency Parsing

Add code
Mar 24, 2022
Figure 1 for Revisiting the Effects of Leakage on Dependency Parsing
Figure 2 for Revisiting the Effects of Leakage on Dependency Parsing
Figure 3 for Revisiting the Effects of Leakage on Dependency Parsing
Figure 4 for Revisiting the Effects of Leakage on Dependency Parsing
Viaarxiv icon