Picture for Stephanie Milani

Stephanie Milani

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Add code
Apr 28, 2022
Figure 1 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 2 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 3 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 4 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Viaarxiv icon

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

Add code
Apr 14, 2022
Figure 1 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 2 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 3 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 4 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Viaarxiv icon

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned

Add code
Feb 17, 2022
Figure 1 for MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Figure 2 for MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Figure 3 for MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Figure 4 for MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Viaarxiv icon

A Survey of Explainable Reinforcement Learning

Add code
Feb 17, 2022
Figure 1 for A Survey of Explainable Reinforcement Learning
Figure 2 for A Survey of Explainable Reinforcement Learning
Figure 3 for A Survey of Explainable Reinforcement Learning
Figure 4 for A Survey of Explainable Reinforcement Learning
Viaarxiv icon

The MineRL BASALT Competition on Learning from Human Feedback

Add code
Jul 05, 2021
Figure 1 for The MineRL BASALT Competition on Learning from Human Feedback
Figure 2 for The MineRL BASALT Competition on Learning from Human Feedback
Viaarxiv icon

Towards robust and domain agnostic reinforcement learning competitions

Add code
Jun 07, 2021
Figure 1 for Towards robust and domain agnostic reinforcement learning competitions
Figure 2 for Towards robust and domain agnostic reinforcement learning competitions
Figure 3 for Towards robust and domain agnostic reinforcement learning competitions
Figure 4 for Towards robust and domain agnostic reinforcement learning competitions
Viaarxiv icon

Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods

Add code
Feb 25, 2021
Figure 1 for Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods
Figure 2 for Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods
Figure 3 for Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods
Figure 4 for Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods
Viaarxiv icon

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

Add code
Jan 26, 2021
Figure 1 for The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Figure 2 for The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Figure 3 for The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Figure 4 for The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Viaarxiv icon

Guaranteeing Reproducibility in Deep Learning Competitions

Add code
May 12, 2020
Viaarxiv icon

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning

Add code
Mar 27, 2020
Figure 1 for Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning
Figure 2 for Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning
Figure 3 for Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning
Figure 4 for Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning
Viaarxiv icon