Picture for Yannick Metz

Yannick Metz

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

Add code
Aug 08, 2023
Figure 1 for RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
Figure 2 for RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
Viaarxiv icon

How to Enable Uncertainty Estimation in Proximal Policy Optimization

Add code
Oct 07, 2022
Figure 1 for How to Enable Uncertainty Estimation in Proximal Policy Optimization
Figure 2 for How to Enable Uncertainty Estimation in Proximal Policy Optimization
Figure 3 for How to Enable Uncertainty Estimation in Proximal Policy Optimization
Figure 4 for How to Enable Uncertainty Estimation in Proximal Policy Optimization
Viaarxiv icon

BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

Add code
Aug 22, 2022
Figure 1 for BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning
Figure 2 for BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning
Figure 3 for BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning
Figure 4 for BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning
Viaarxiv icon