Picture for Kamal Ndousse

Kamal Ndousse

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Add code
Apr 12, 2022
Figure 1 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 2 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 3 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 4 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Viaarxiv icon

A General Language Assistant as a Laboratory for Alignment

Add code
Dec 09, 2021
Figure 1 for A General Language Assistant as a Laboratory for Alignment
Figure 2 for A General Language Assistant as a Laboratory for Alignment
Figure 3 for A General Language Assistant as a Laboratory for Alignment
Figure 4 for A General Language Assistant as a Laboratory for Alignment
Viaarxiv icon

Multi-agent Social Reinforcement Learning Improves Generalization

Add code
Oct 01, 2020
Figure 1 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 2 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 3 for Multi-agent Social Reinforcement Learning Improves Generalization
Figure 4 for Multi-agent Social Reinforcement Learning Improves Generalization
Viaarxiv icon