Alert button
Picture for Jason Weston

Jason Weston

Alert button

Reverse Training to Nurse the Reversal Curse

Add code
Bookmark button
Alert button
Mar 20, 2024
Olga Golovneva, Zeyuan Allen-Zhu, Jason Weston, Sainbayar Sukhbaatar

Figure 1 for Reverse Training to Nurse the Reversal Curse
Figure 2 for Reverse Training to Nurse the Reversal Curse
Figure 3 for Reverse Training to Nurse the Reversal Curse
Figure 4 for Reverse Training to Nurse the Reversal Curse
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Bookmark button
Alert button
Mar 12, 2024
Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu Xu, Xi Victoria Lin, Baptiste Rozière, Jacob Kahn, Daniel Li, Wen-tau Yih, Jason Weston, Xian Li

Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Bookmark button
Alert button
Feb 21, 2024
Dheeraj Mekala, Jason Weston, Jack Lanchantin, Roberta Raileanu, Maria Lomeli, Jingbo Shang, Jane Dwivedi-Yu

Viaarxiv icon

Self-Rewarding Language Models

Add code
Bookmark button
Alert button
Jan 18, 2024
Weizhe Yuan, Richard Yuanzhe Pang, Kyunghyun Cho, Sainbayar Sukhbaatar, Jing Xu, Jason Weston

Viaarxiv icon

Some things are more CRINGE than others: Preference Optimization with the Pairwise Cringe Loss

Add code
Bookmark button
Alert button
Dec 27, 2023
Jing Xu, Andrew Lee, Sainbayar Sukhbaatar, Jason Weston

Viaarxiv icon

System 2 Attention (is something you might need too)

Add code
Bookmark button
Alert button
Nov 20, 2023
Jason Weston, Sainbayar Sukhbaatar

Viaarxiv icon

The ART of LLM Refinement: Ask, Refine, and Trust

Add code
Bookmark button
Alert button
Nov 14, 2023
Kumar Shridhar, Koustuv Sinha, Andrew Cohen, Tianlu Wang, Ping Yu, Ram Pasunuru, Mrinmaya Sachan, Jason Weston, Asli Celikyilmaz

Viaarxiv icon

Branch-Solve-Merge Improves Large Language Model Evaluation and Generation

Add code
Bookmark button
Alert button
Oct 23, 2023
Swarnadeep Saha, Omer Levy, Asli Celikyilmaz, Mohit Bansal, Jason Weston, Xian Li

Viaarxiv icon

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

Add code
Bookmark button
Alert button
Oct 08, 2023
Howard Chen, Ramakanth Pasunuru, Jason Weston, Asli Celikyilmaz

Figure 1 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 2 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 3 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 4 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Viaarxiv icon

Chain-of-Verification Reduces Hallucination in Large Language Models

Add code
Bookmark button
Alert button
Sep 25, 2023
Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason Weston

Figure 1 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 2 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 3 for Chain-of-Verification Reduces Hallucination in Large Language Models
Figure 4 for Chain-of-Verification Reduces Hallucination in Large Language Models
Viaarxiv icon