Picture for Shaoqin Zhu

Shaoqin Zhu

ISEP: Implicit Support Expansion for Offline Reinforcement Learning via Stochastic Policy Optimization

Add code
May 18, 2026
Viaarxiv icon