Picture for Yijing Wang

Yijing Wang

Boosting Maximum Entropy Reinforcement Learning via One-Step Flow Matching

Add code
Feb 02, 2026
Viaarxiv icon