Picture for Bryan Dai

Bryan Dai

Context as a Tool: Context Management for Long-Horizon SWE-Agents

Add code
Dec 26, 2025
Viaarxiv icon

Universal Reasoning Model

Add code
Dec 24, 2025
Viaarxiv icon

Scaling Laws for Code: Every Programming Language Matters

Add code
Dec 15, 2025
Viaarxiv icon

Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

One-shot Entropy Minimization

Add code
May 27, 2025
Viaarxiv icon

REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Add code
May 27, 2025
Viaarxiv icon

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Add code
Feb 20, 2025
Figure 1 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 2 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 3 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 4 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Viaarxiv icon