Alert button
Picture for Supratik Paul

Supratik Paul

Alert button

Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula

Add code
Bookmark button
Alert button
Dec 02, 2022
Eli Bronstein, Sirish Srinivasan, Supratik Paul, Aman Sinha, Matthew O'Kelly, Payam Nikdel, Shimon Whiteson

Figure 1 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 2 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 3 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 4 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Viaarxiv icon

Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving

Add code
Bookmark button
Alert button
Oct 18, 2022
Eli Bronstein, Mark Palatucci, Dominik Notz, Brandyn White, Alex Kuefler, Yiren Lu, Supratik Paul, Payam Nikdel, Paul Mougin, Hongge Chen, Justin Fu, Austin Abrams, Punit Shah, Evan Racah, Benjamin Frenkel, Shimon Whiteson, Dragomir Anguelov

Figure 1 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 2 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 3 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 4 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Viaarxiv icon

Fast Efficient Hyperparameter Tuning for Policy Gradients

Add code
Bookmark button
Alert button
Feb 18, 2019
Supratik Paul, Vitaly Kurin, Shimon Whiteson

Figure 1 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 2 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 3 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 4 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Viaarxiv icon

Learning from Demonstration in the Wild

Add code
Bookmark button
Alert button
Nov 08, 2018
Feryal Behbahani, Kyriacos Shiarlis, Xi Chen, Vitaly Kurin, Sudhanshu Kasewa, Ciprian Stirbu, João Gomes, Supratik Paul, Frans A. Oliehoek, João Messias, Shimon Whiteson

Figure 1 for Learning from Demonstration in the Wild
Figure 2 for Learning from Demonstration in the Wild
Figure 3 for Learning from Demonstration in the Wild
Figure 4 for Learning from Demonstration in the Wild
Viaarxiv icon

Fingerprint Policy Optimisation for Robust Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 15, 2018
Supratik Paul, Michael A. Osborne, Shimon Whiteson

Figure 1 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 2 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 3 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 4 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Viaarxiv icon

Alternating Optimisation and Quadrature for Robust Control

Add code
Bookmark button
Alert button
Dec 18, 2017
Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson

Figure 1 for Alternating Optimisation and Quadrature for Robust Control
Figure 2 for Alternating Optimisation and Quadrature for Robust Control
Figure 3 for Alternating Optimisation and Quadrature for Robust Control
Figure 4 for Alternating Optimisation and Quadrature for Robust Control
Viaarxiv icon