Picture for Xinghao Zhao

Xinghao Zhao

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Add code
Jan 09, 2026
Viaarxiv icon