Picture for Zongkai Liu

Zongkai Liu

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Add code
May 19, 2025
Viaarxiv icon

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Add code
May 18, 2025
Viaarxiv icon

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

Rapid Learning in Constrained Minimax Games with Negative Momentum

Add code
Dec 31, 2024
Viaarxiv icon

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning

Add code
Sep 16, 2024
Viaarxiv icon

Policy-regularized Offline Multi-objective Reinforcement Learning

Add code
Jan 04, 2024
Viaarxiv icon