Picture for Ivo Petrov

Ivo Petrov

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

Add code
Oct 06, 2025
Viaarxiv icon

MathArena: Evaluating LLMs on Uncontaminated Math Competitions

Add code
May 29, 2025
Viaarxiv icon

Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad

Add code
Mar 27, 2025
Viaarxiv icon

GRAIN: Exact Graph Reconstruction from Gradients

Add code
Mar 03, 2025
Viaarxiv icon

DAGER: Exact Gradient Inversion for Large Language Models

Add code
May 24, 2024
Viaarxiv icon