Picture for Wenkai Fang

Wenkai Fang

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Add code
May 25, 2025
Figure 1 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 2 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 3 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 4 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Viaarxiv icon

Reasoning with Reinforced Functional Token Tuning

Add code
Feb 19, 2025
Figure 1 for Reasoning with Reinforced Functional Token Tuning
Figure 2 for Reasoning with Reinforced Functional Token Tuning
Figure 3 for Reasoning with Reinforced Functional Token Tuning
Figure 4 for Reasoning with Reinforced Functional Token Tuning
Viaarxiv icon

Odyssey: Empowering Agents with Open-World Skills

Add code
Jul 22, 2024
Figure 1 for Odyssey: Empowering Agents with Open-World Skills
Figure 2 for Odyssey: Empowering Agents with Open-World Skills
Figure 3 for Odyssey: Empowering Agents with Open-World Skills
Figure 4 for Odyssey: Empowering Agents with Open-World Skills
Viaarxiv icon