Picture for Osbert Bastani

Osbert Bastani

Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates

Add code
Feb 25, 2022
Figure 1 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 2 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 3 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 4 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Viaarxiv icon

Understanding Robust Generalization in Learning Regular Languages

Add code
Feb 20, 2022
Figure 1 for Understanding Robust Generalization in Learning Regular Languages
Figure 2 for Understanding Robust Generalization in Learning Regular Languages
Figure 3 for Understanding Robust Generalization in Learning Regular Languages
Figure 4 for Understanding Robust Generalization in Learning Regular Languages
Viaarxiv icon

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Add code
Feb 04, 2022
Figure 1 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 2 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 3 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 4 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Viaarxiv icon

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Add code
Dec 14, 2021
Figure 1 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 2 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 3 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 4 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Viaarxiv icon

Safely Bridging Offline and Online Reinforcement Learning

Add code
Oct 25, 2021
Figure 1 for Safely Bridging Offline and Online Reinforcement Learning
Viaarxiv icon

Safe Human-Interactive Control via Shielding

Add code
Oct 11, 2021
Figure 1 for Safe Human-Interactive Control via Shielding
Figure 2 for Safe Human-Interactive Control via Shielding
Figure 3 for Safe Human-Interactive Control via Shielding
Figure 4 for Safe Human-Interactive Control via Shielding
Viaarxiv icon

Synthesizing Machine Learning Programs with PAC Guarantees via Statistical Sketching

Add code
Oct 11, 2021
Figure 1 for Synthesizing Machine Learning Programs with PAC Guarantees via Statistical Sketching
Figure 2 for Synthesizing Machine Learning Programs with PAC Guarantees via Statistical Sketching
Figure 3 for Synthesizing Machine Learning Programs with PAC Guarantees via Statistical Sketching
Figure 4 for Synthesizing Machine Learning Programs with PAC Guarantees via Statistical Sketching
Viaarxiv icon

Robust Generalization of Quadratic Neural Networks via Function Identification

Add code
Sep 22, 2021
Viaarxiv icon

Improving Human Decision-Making with Machine Learning

Add code
Aug 31, 2021
Figure 1 for Improving Human Decision-Making with Machine Learning
Figure 2 for Improving Human Decision-Making with Machine Learning
Figure 3 for Improving Human Decision-Making with Machine Learning
Figure 4 for Improving Human Decision-Making with Machine Learning
Viaarxiv icon

Conservative Offline Distributional Reinforcement Learning

Add code
Jul 12, 2021
Figure 1 for Conservative Offline Distributional Reinforcement Learning
Figure 2 for Conservative Offline Distributional Reinforcement Learning
Figure 3 for Conservative Offline Distributional Reinforcement Learning
Figure 4 for Conservative Offline Distributional Reinforcement Learning
Viaarxiv icon