Picture for Jakob Foerster

Jakob Foerster

Mixture of Experts in a Mixture of RL settings

Add code
Jun 26, 2024
Figure 1 for Mixture of Experts in a Mixture of RL settings
Figure 2 for Mixture of Experts in a Mixture of RL settings
Figure 3 for Mixture of Experts in a Mixture of RL settings
Figure 4 for Mixture of Experts in a Mixture of RL settings
Viaarxiv icon

Behaviour Distillation

Add code
Jun 21, 2024
Viaarxiv icon

Discovering Preference Optimization Algorithms with and for Large Language Models

Add code
Jun 12, 2024
Viaarxiv icon

Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning

Add code
Jun 01, 2024
Viaarxiv icon

Risks and Opportunities of Open-Source Generative AI

Add code
May 14, 2024
Figure 1 for Risks and Opportunities of Open-Source Generative AI
Figure 2 for Risks and Opportunities of Open-Source Generative AI
Figure 3 for Risks and Opportunities of Open-Source Generative AI
Figure 4 for Risks and Opportunities of Open-Source Generative AI
Viaarxiv icon

PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition

Add code
May 14, 2024
Viaarxiv icon

Select to Perfect: Imitating desired behavior from large multi-agent data

Add code
May 06, 2024
Figure 1 for Select to Perfect: Imitating desired behavior from large multi-agent data
Figure 2 for Select to Perfect: Imitating desired behavior from large multi-agent data
Figure 3 for Select to Perfect: Imitating desired behavior from large multi-agent data
Figure 4 for Select to Perfect: Imitating desired behavior from large multi-agent data
Viaarxiv icon

Near to Mid-term Risks and Opportunities of Open Source Generative AI

Add code
Apr 25, 2024
Figure 1 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 2 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 3 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 4 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Viaarxiv icon

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Apr 15, 2024
Viaarxiv icon

Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection

Add code
Apr 10, 2024
Figure 1 for Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
Figure 2 for Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
Figure 3 for Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
Figure 4 for Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
Viaarxiv icon