Picture for Roberta Raileanu

Roberta Raileanu

Jack

DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft

Add code
Apr 23, 2024
Viaarxiv icon

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Mar 07, 2024
Figure 1 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 2 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 3 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 4 for Teaching Large Language Models to Reason with Reinforcement Learning
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Feb 21, 2024
Figure 1 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 2 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 3 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 4 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Viaarxiv icon

The Generalization Gap in Offline Reinforcement Learning

Add code
Dec 10, 2023
Figure 1 for The Generalization Gap in Offline Reinforcement Learning
Figure 2 for The Generalization Gap in Offline Reinforcement Learning
Figure 3 for The Generalization Gap in Offline Reinforcement Learning
Figure 4 for The Generalization Gap in Offline Reinforcement Learning
Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Dec 06, 2023
Figure 1 for Generalization to New Sequential Decision Making Tasks with In-Context Learning
Figure 2 for Generalization to New Sequential Decision Making Tasks with In-Context Learning
Figure 3 for Generalization to New Sequential Decision Making Tasks with In-Context Learning
Figure 4 for Generalization to New Sequential Decision Making Tasks with In-Context Learning
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Add code
Sep 29, 2023
Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

Chain-of-Verification Reduces Hallucination in Large Language Models

Add code
Sep 25, 2023
Figure 1 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 2 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 3 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 4 for Chain-of-Verification Reduces Hallucination in Large Language Models
Viaarxiv icon

Challenges and Applications of Large Language Models

Add code
Jul 19, 2023
Viaarxiv icon