Picture for Bryan Dai

Bryan Dai

Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

One-shot Entropy Minimization

Add code
May 27, 2025
Viaarxiv icon

REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Add code
May 27, 2025
Viaarxiv icon

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Add code
Feb 20, 2025
Viaarxiv icon