Picture for Cole Granger

Cole Granger

Tricky$^2$: Towards a Benchmark for Evaluating Human and LLM Error Interactions

Add code
Jan 26, 2026
Viaarxiv icon