Picture for Usman Anwar

Usman Anwar

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Apr 15, 2024
Viaarxiv icon

Reward Model Ensembles Help Mitigate Overoptimization

Add code
Oct 04, 2023
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Jul 27, 2023
Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Add code
Nov 27, 2022
Figure 1 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 2 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 3 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 4 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Viaarxiv icon

Inverse Constrained Reinforcement Learning

Add code
Nov 24, 2020
Figure 1 for Inverse Constrained Reinforcement Learning
Figure 2 for Inverse Constrained Reinforcement Learning
Figure 3 for Inverse Constrained Reinforcement Learning
Figure 4 for Inverse Constrained Reinforcement Learning
Viaarxiv icon

Learning To Solve Differential Equations Across Initial Conditions

Apr 19, 2020
Figure 1 for Learning To Solve Differential Equations Across Initial Conditions
Figure 2 for Learning To Solve Differential Equations Across Initial Conditions
Viaarxiv icon