Picture for Keith Ross

Keith Ross

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Add code
Feb 07, 2024
Viaarxiv icon

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

Add code
Oct 06, 2023
Figure 1 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 2 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 3 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 4 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Viaarxiv icon

On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs

Add code
Sep 07, 2022
Figure 1 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 2 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 3 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 4 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Viaarxiv icon

VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning

Add code
Feb 17, 2022
Figure 1 for VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Figure 2 for VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Figure 3 for VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Figure 4 for VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Viaarxiv icon

Randomized Ensembled Double Q-Learning: Learning Fast Without a Model

Add code
Jan 15, 2021
Figure 1 for Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Figure 2 for Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Figure 3 for Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Figure 4 for Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Viaarxiv icon

On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

Add code
Feb 10, 2020
Figure 1 for On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning
Viaarxiv icon

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning

Add code
Oct 27, 2019
Figure 1 for BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Figure 2 for BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Figure 3 for BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Figure 4 for BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Viaarxiv icon

Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning

Add code
Oct 10, 2019
Figure 1 for Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning
Figure 2 for Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning
Figure 3 for Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning
Figure 4 for Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning
Viaarxiv icon

Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past

Add code
Jun 10, 2019
Figure 1 for Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Figure 2 for Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Figure 3 for Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Figure 4 for Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Viaarxiv icon