Picture for Joey Hong

Joey Hong

On the Sensitivity of Reward Inference to Misspecified Human Models

Add code
Dec 09, 2022
Figure 1 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 2 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 3 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 4 for On the Sensitivity of Reward Inference to Misspecified Human Models
Viaarxiv icon

Confidence-Conditioned Value Functions for Offline Reinforcement Learning

Add code
Dec 08, 2022
Viaarxiv icon

When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?

Add code
Apr 12, 2022
Figure 1 for When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Figure 2 for When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Figure 3 for When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Figure 4 for When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Viaarxiv icon

Compositional Generalization and Decomposition in Neural Program Synthesis

Add code
Apr 07, 2022
Figure 1 for Compositional Generalization and Decomposition in Neural Program Synthesis
Figure 2 for Compositional Generalization and Decomposition in Neural Program Synthesis
Figure 3 for Compositional Generalization and Decomposition in Neural Program Synthesis
Figure 4 for Compositional Generalization and Decomposition in Neural Program Synthesis
Viaarxiv icon

Deep Hierarchy in Bandits

Add code
Feb 03, 2022
Figure 1 for Deep Hierarchy in Bandits
Figure 2 for Deep Hierarchy in Bandits
Figure 3 for Deep Hierarchy in Bandits
Figure 4 for Deep Hierarchy in Bandits
Viaarxiv icon

Hierarchical Bayesian Bandits

Add code
Nov 12, 2021
Figure 1 for Hierarchical Bayesian Bandits
Figure 2 for Hierarchical Bayesian Bandits
Figure 3 for Hierarchical Bayesian Bandits
Viaarxiv icon

Thompson Sampling with a Mixture Prior

Add code
Jun 10, 2021
Figure 1 for Thompson Sampling with a Mixture Prior
Figure 2 for Thompson Sampling with a Mixture Prior
Viaarxiv icon

Non-Stationary Latent Bandits

Add code
Dec 01, 2020
Figure 1 for Non-Stationary Latent Bandits
Figure 2 for Non-Stationary Latent Bandits
Figure 3 for Non-Stationary Latent Bandits
Viaarxiv icon

Latent Programmer: Discrete Latent Codes for Program Synthesis

Add code
Dec 01, 2020
Figure 1 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 2 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 3 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 4 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Viaarxiv icon

Latent Bandits Revisited

Add code
Jun 15, 2020
Figure 1 for Latent Bandits Revisited
Figure 2 for Latent Bandits Revisited
Viaarxiv icon