Picture for Wenxin E

Wenxin E

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Add code
Oct 10, 2025
Viaarxiv icon