Picture for Perry Dong

Perry Dong

Reinforcement Learning via Implicit Imitation Guidance

Add code
Jun 09, 2025
Viaarxiv icon

What Matters for Batch Online Reinforcement Learning in Robotics?

Add code
May 12, 2025
Viaarxiv icon

Adaptively Learning to Select-Rank in Online Platforms

Add code
Jun 07, 2024
Viaarxiv icon

RLIF: Interactive Imitation Learning as Reinforcement Learning

Add code
Nov 21, 2023
Viaarxiv icon

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

Add code
Oct 18, 2023
Figure 1 for Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Figure 2 for Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Figure 3 for Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Figure 4 for Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Viaarxiv icon

Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction

Add code
Feb 13, 2023
Viaarxiv icon