Picture for Zeyu Gan

Zeyu Gan

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Add code
Oct 16, 2025
Viaarxiv icon

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning

Add code
Jan 26, 2025
Viaarxiv icon