Alert button
Picture for Osbert Bastani

Osbert Bastani

Alert button

Specification-Guided Learning of Nash Equilibria with High Social Welfare

Jun 06, 2022
Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur

Figure 1 for Specification-Guided Learning of Nash Equilibria with High Social Welfare
Figure 2 for Specification-Guided Learning of Nash Equilibria with High Social Welfare
Figure 3 for Specification-Guided Learning of Nash Equilibria with High Social Welfare
Viaarxiv icon

Practical Adversarial Multivalid Conformal Prediction

Jun 02, 2022
Osbert Bastani, Varun Gupta, Christopher Jung, Georgy Noarov, Ramya Ramalingam, Aaron Roth

Figure 1 for Practical Adversarial Multivalid Conformal Prediction
Figure 2 for Practical Adversarial Multivalid Conformal Prediction
Figure 3 for Practical Adversarial Multivalid Conformal Prediction
Figure 4 for Practical Adversarial Multivalid Conformal Prediction
Viaarxiv icon

Counterfactual Explanations for Natural Language Interfaces

Apr 27, 2022
George Tolkachev, Stephen Mell, Steve Zdancewic, Osbert Bastani

Figure 1 for Counterfactual Explanations for Natural Language Interfaces
Figure 2 for Counterfactual Explanations for Natural Language Interfaces
Viaarxiv icon

Towards PAC Multi-Object Detection and Tracking

Apr 15, 2022
Shuo Li, Sangdon Park, Xiayan Ji, Insup Lee, Osbert Bastani

Figure 1 for Towards PAC Multi-Object Detection and Tracking
Figure 2 for Towards PAC Multi-Object Detection and Tracking
Figure 3 for Towards PAC Multi-Object Detection and Tracking
Figure 4 for Towards PAC Multi-Object Detection and Tracking
Viaarxiv icon

Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates

Feb 25, 2022
Souradeep Dutta, Kaustubh Sridhar, Osbert Bastani, Edgar Dobriban, James Weimer, Insup Lee, Julia Parish-Morris

Figure 1 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 2 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 3 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Figure 4 for Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Viaarxiv icon

Understanding Robust Generalization in Learning Regular Languages

Feb 20, 2022
Soham Dan, Osbert Bastani, Dan Roth

Figure 1 for Understanding Robust Generalization in Learning Regular Languages
Figure 2 for Understanding Robust Generalization in Learning Regular Languages
Figure 3 for Understanding Robust Generalization in Learning Regular Languages
Figure 4 for Understanding Robust Generalization in Learning Regular Languages
Viaarxiv icon

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Feb 04, 2022
Yecheng Jason Ma, Andrew Shen, Dinesh Jayaraman, Osbert Bastani

Figure 1 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 2 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 3 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Figure 4 for SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Viaarxiv icon

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Dec 14, 2021
Yecheng Jason Ma, Andrew Shen, Osbert Bastani, Dinesh Jayaraman

Figure 1 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 2 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 3 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Figure 4 for Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Viaarxiv icon

Safely Bridging Offline and Online Reinforcement Learning

Oct 25, 2021
Wanqiao Xu, Kan Xu, Hamsa Bastani, Osbert Bastani

Figure 1 for Safely Bridging Offline and Online Reinforcement Learning
Viaarxiv icon

Safe Human-Interactive Control via Shielding

Oct 11, 2021
Jeevana Priya Inala, Yecheng Jason Ma, Osbert Bastani, Xin Zhang, Armando Solar-Lezama

Figure 1 for Safe Human-Interactive Control via Shielding
Figure 2 for Safe Human-Interactive Control via Shielding
Figure 3 for Safe Human-Interactive Control via Shielding
Figure 4 for Safe Human-Interactive Control via Shielding
Viaarxiv icon