Picture for Mrinmaya Sachan

Mrinmaya Sachan

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

Fluid Representations in Reasoning Models

Add code
Feb 04, 2026
Viaarxiv icon

Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification

Add code
Jan 29, 2026
Viaarxiv icon

PATS: Personality-Aware Teaching Strategies with Large Language Model Tutors

Add code
Jan 13, 2026
Viaarxiv icon

Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

Add code
Dec 10, 2025
Viaarxiv icon

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Add code
Nov 11, 2025
Viaarxiv icon

Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding

Add code
Sep 26, 2025
Viaarxiv icon

Can Vision-Language Models Solve Visual Math Equations?

Add code
Sep 10, 2025
Viaarxiv icon

Probing for Arithmetic Errors in Language Models

Add code
Jul 16, 2025
Viaarxiv icon

The Medium Is Not the Message: Deconfounding Text Embeddings via Linear Concept Erasure

Add code
Jul 01, 2025
Viaarxiv icon