Picture for Fan Shu

Fan Shu

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Add code
Feb 27, 2026
Viaarxiv icon