Alfworld


RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents

Add code
Jul 30, 2025
Viaarxiv icon

CoEx -- Co-evolving World-model and Exploration

Add code
Jul 29, 2025
Viaarxiv icon

Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search

Add code
Jun 10, 2025
Viaarxiv icon

Enhancing Decision-Making of Large Language Models via Actor-Critic

Add code
Jun 04, 2025
Viaarxiv icon

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Add code
May 27, 2025
Viaarxiv icon

Agent-Environment Alignment via Automated Interface Generation

Add code
May 27, 2025
Viaarxiv icon

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

EMAC+: Embodied Multimodal Agent for Collaborative Planning with VLM+LLM

Add code
May 26, 2025
Viaarxiv icon

Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking

Add code
May 26, 2025
Viaarxiv icon

ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection

Add code
May 21, 2025
Viaarxiv icon