Picture for Derek Li

Derek Li

Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs

Add code
Jul 02, 2025
Viaarxiv icon