Picture for Wenkai Fang

Wenkai Fang

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Add code
May 25, 2025
Viaarxiv icon

Reasoning with Reinforced Functional Token Tuning

Add code
Feb 19, 2025
Viaarxiv icon

Odyssey: Empowering Agents with Open-World Skills

Add code
Jul 22, 2024
Figure 1 for Odyssey: Empowering Agents with Open-World Skills
Figure 2 for Odyssey: Empowering Agents with Open-World Skills
Figure 3 for Odyssey: Empowering Agents with Open-World Skills
Figure 4 for Odyssey: Empowering Agents with Open-World Skills
Viaarxiv icon