Picture for Dibya Ghosh

Dibya Ghosh

Octo: An Open-Source Generalist Robot Policy

Add code
May 20, 2024
Viaarxiv icon

Accelerating Exploration with Unlabeled Prior Data

Add code
Nov 21, 2023
Figure 1 for Accelerating Exploration with Unlabeled Prior Data
Figure 2 for Accelerating Exploration with Unlabeled Prior Data
Figure 3 for Accelerating Exploration with Unlabeled Prior Data
Figure 4 for Accelerating Exploration with Unlabeled Prior Data
Viaarxiv icon

Robotic Offline RL from Internet Videos via Value-Function Pre-Training

Add code
Sep 22, 2023
Figure 1 for Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Figure 2 for Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Figure 3 for Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Figure 4 for Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Viaarxiv icon

HIQL: Offline Goal-Conditioned RL with Latent States as Actions

Add code
Jul 22, 2023
Figure 1 for HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Figure 2 for HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Figure 3 for HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Figure 4 for HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Viaarxiv icon

Reinforcement Learning from Passive Data via Latent Intentions

Add code
Apr 10, 2023
Figure 1 for Reinforcement Learning from Passive Data via Latent Intentions
Figure 2 for Reinforcement Learning from Passive Data via Latent Intentions
Figure 3 for Reinforcement Learning from Passive Data via Latent Intentions
Figure 4 for Reinforcement Learning from Passive Data via Latent Intentions
Viaarxiv icon

Distributionally Adaptive Meta Reinforcement Learning

Add code
Oct 06, 2022
Figure 1 for Distributionally Adaptive Meta Reinforcement Learning
Figure 2 for Distributionally Adaptive Meta Reinforcement Learning
Figure 3 for Distributionally Adaptive Meta Reinforcement Learning
Figure 4 for Distributionally Adaptive Meta Reinforcement Learning
Viaarxiv icon

Offline RL Policies Should be Trained to be Adaptive

Add code
Jul 05, 2022
Figure 1 for Offline RL Policies Should be Trained to be Adaptive
Figure 2 for Offline RL Policies Should be Trained to be Adaptive
Figure 3 for Offline RL Policies Should be Trained to be Adaptive
Figure 4 for Offline RL Policies Should be Trained to be Adaptive
Viaarxiv icon

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Add code
Jul 13, 2021
Figure 1 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 2 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 3 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 4 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Viaarxiv icon

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

Add code
Oct 27, 2020
Figure 1 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 2 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 3 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 4 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Viaarxiv icon

Representations for Stable Off-Policy Reinforcement Learning

Add code
Jul 10, 2020
Figure 1 for Representations for Stable Off-Policy Reinforcement Learning
Figure 2 for Representations for Stable Off-Policy Reinforcement Learning
Figure 3 for Representations for Stable Off-Policy Reinforcement Learning
Figure 4 for Representations for Stable Off-Policy Reinforcement Learning
Viaarxiv icon