policy gradient


Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning

Add code
Feb 27, 2025
Viaarxiv icon

Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed Bandits

Add code
Feb 19, 2025
Viaarxiv icon

Accelerating Model-Based Reinforcement Learning with State-Space World Models

Add code
Feb 27, 2025
Viaarxiv icon

Neural Combinatorial Optimization via Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

Reevaluating Policy Gradient Methods for Imperfect-Information Games

Add code
Feb 13, 2025
Viaarxiv icon

Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

Add code
Feb 27, 2025
Viaarxiv icon

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective

Add code
Feb 24, 2025
Viaarxiv icon

A Multi-Agent DRL-Based Framework for Optimal Resource Allocation and Twin Migration in the Multi-Tier Vehicular Metaverse

Add code
Feb 26, 2025
Viaarxiv icon

MoE-Loco: Mixture of Experts for Multitask Locomotion

Add code
Mar 11, 2025
Viaarxiv icon

A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding

Add code
Feb 24, 2025
Viaarxiv icon