Picture for Crish Nagarkar

Crish Nagarkar

Can LLM Reasoning Be Trusted? A Comparative Study: Using Human Benchmarking on Statistical Tasks

Add code
Jan 20, 2026
Viaarxiv icon