Imitation Learning


Imitation learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation

Add code
Oct 30, 2025
Viaarxiv icon

Learning to Manage Investment Portfolios beyond Simple Utility Functions

Add code
Oct 30, 2025
Viaarxiv icon

Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving

Add code
Oct 30, 2025
Viaarxiv icon

Human-in-the-loop Online Rejection Sampling for Robotic Manipulation

Add code
Oct 30, 2025
Viaarxiv icon

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Add code
Oct 29, 2025
Viaarxiv icon

ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation

Add code
Oct 27, 2025
Viaarxiv icon

PORTool: Tool-Use LLM Training with Rewarded Tree

Add code
Oct 29, 2025
Viaarxiv icon

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

Add code
Oct 29, 2025
Viaarxiv icon

FastVLM: Self-Speculative Decoding for Fast Vision-Language Model Inference

Add code
Oct 26, 2025
Viaarxiv icon

Edge Collaborative Gaussian Splatting with Integrated Rendering and Communication

Add code
Oct 26, 2025
Viaarxiv icon