Alert button
Picture for Roberta Raileanu

Roberta Raileanu

Alert button

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 07, 2024
Alex Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Maksym Zhuravinskyi, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu

Figure 1 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 2 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 3 for Teaching Large Language Models to Reason with Reinforcement Learning
Figure 4 for Teaching Large Language Models to Reason with Reinforcement Learning
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Bookmark button
Alert button
Feb 26, 2024
Mikayel Samvelyan, Sharath Chandra Raparthy, Andrei Lupu, Eric Hambro, Aram H. Markosyan, Manish Bhatt, Yuning Mao, Minqi Jiang, Jack Parker-Holder, Jakob Foerster, Tim Rocktäschel, Roberta Raileanu

Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Bookmark button
Alert button
Feb 21, 2024
Dheeraj Mekala, Jason Weston, Jack Lanchantin, Roberta Raileanu, Maria Lomeli, Jingbo Shang, Jane Dwivedi-Yu

Viaarxiv icon

The Generalization Gap in Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 10, 2023
Ishita Mediratta, Qingfei You, Minqi Jiang, Roberta Raileanu

Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Bookmark button
Alert button
Dec 06, 2023
Sharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu

Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Bookmark button
Alert button
Oct 10, 2023
Robert Kirk, Ishita Mediratta, Christoforos Nalmpantis, Jelena Luketina, Eric Hambro, Edward Grefenstette, Roberta Raileanu

Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Add code
Bookmark button
Alert button
Sep 29, 2023
Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

Chain-of-Verification Reduces Hallucination in Large Language Models

Add code
Bookmark button
Alert button
Sep 25, 2023
Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason Weston

Figure 1 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 2 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 3 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 4 for Chain-of-Verification Reduces Hallucination in Large Language Models
Viaarxiv icon

Challenges and Applications of Large Language Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Jean Kaddour, Joshua Harris, Maximilian Mozes, Herbie Bradley, Roberta Raileanu, Robert McHardy

Viaarxiv icon