Picture for Kongcheng Zhang

Kongcheng Zhang

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Add code
May 25, 2025
Viaarxiv icon

Reasoning with Reinforced Functional Token Tuning

Add code
Feb 19, 2025
Viaarxiv icon

Odyssey: Empowering Agents with Open-World Skills

Add code
Jul 22, 2024
Figure 1 for Odyssey: Empowering Agents with Open-World Skills
Figure 2 for Odyssey: Empowering Agents with Open-World Skills
Figure 3 for Odyssey: Empowering Agents with Open-World Skills
Figure 4 for Odyssey: Empowering Agents with Open-World Skills
Viaarxiv icon