Picture for Aldo Pacchiano

Aldo Pacchiano

Supervised Pretraining Can Learn In-Context Reinforcement Learning

Add code
Jun 26, 2023
Figure 1 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 2 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 3 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 4 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Viaarxiv icon

A Unified Model and Dimension for Interactive Estimation

Add code
Jun 09, 2023
Figure 1 for A Unified Model and Dimension for Interactive Estimation
Viaarxiv icon

Data-Driven Regret Balancing for Online Model Selection in Bandits

Add code
Jun 05, 2023
Figure 1 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 2 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 3 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 4 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Viaarxiv icon

Improving Offline RL by Blending Heuristics

Add code
Jun 01, 2023
Viaarxiv icon

Estimating Optimal Policy Value in General Linear Contextual Bandits

Add code
Feb 19, 2023
Figure 1 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Figure 2 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Figure 3 for Estimating Optimal Policy Value in General Linear Contextual Bandits
Viaarxiv icon

Transfer RL via the Undo Maps Formalism

Add code
Nov 26, 2022
Viaarxiv icon

Leveraging Offline Data in Online Reinforcement Learning

Add code
Nov 09, 2022
Viaarxiv icon

Learning General World Models in a Handful of Reward-Free Deployments

Add code
Oct 23, 2022
Figure 1 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 2 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 3 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 4 for Learning General World Models in a Handful of Reward-Free Deployments
Viaarxiv icon

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

Add code
Oct 18, 2022
Figure 1 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 2 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 3 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 4 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Viaarxiv icon

Neural Design for Genetic Perturbation Experiments

Add code
Jul 26, 2022
Figure 1 for Neural Design for Genetic Perturbation Experiments
Figure 2 for Neural Design for Genetic Perturbation Experiments
Figure 3 for Neural Design for Genetic Perturbation Experiments
Figure 4 for Neural Design for Genetic Perturbation Experiments
Viaarxiv icon