Picture for Metaxas Dimitris

Metaxas Dimitris

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
Figure 2 for EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
Figure 3 for EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
Figure 4 for EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
Viaarxiv icon