Picture for Zhenhan Bai

Zhenhan Bai

Deep Research as Rubric for Reinforcement Learning

Add code
May 31, 2026
Viaarxiv icon