Picture for Erik Y. Wang

Erik Y. Wang

AdaBoost Does Not Always Cycle: A Computer-Assisted Counterexample

Add code
Apr 08, 2026
Viaarxiv icon

HorizonMath: Measuring AI Progress Toward Mathematical Discovery with Automatic Verification

Add code
Mar 16, 2026
Viaarxiv icon

HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class

Add code
May 17, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics

Add code
Oct 13, 2024
Figure 1 for HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics
Figure 2 for HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics
Figure 3 for HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics
Figure 4 for HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics
Viaarxiv icon