Picture for Cezara Petrui

Cezara Petrui

RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics

Add code
May 18, 2025
Viaarxiv icon