Picture for Li Yu-Jhe

Li Yu-Jhe

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon