Alert button
Picture for Olga Golovneva

Olga Golovneva

Alert button

Reverse Training to Nurse the Reversal Curse

Add code
Bookmark button
Alert button
Mar 20, 2024
Olga Golovneva, Zeyuan Allen-Zhu, Jason Weston, Sainbayar Sukhbaatar

Figure 1 for Reverse Training to Nurse the Reversal Curse
Figure 2 for Reverse Training to Nurse the Reversal Curse
Figure 3 for Reverse Training to Nurse the Reversal Curse
Figure 4 for Reverse Training to Nurse the Reversal Curse
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Bookmark button
Alert button
Mar 12, 2024
Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu Xu, Xi Victoria Lin, Baptiste Rozière, Jacob Kahn, Daniel Li, Wen-tau Yih, Jason Weston, Xian Li

Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

Efficient Tool Use with Chain-of-Abstraction Reasoning

Add code
Bookmark button
Alert button
Jan 30, 2024
Silin Gao, Jane Dwivedi-Yu, Ping Yu, Xiaoqing Ellen Tan, Ramakanth Pasunuru, Olga Golovneva, Koustuv Sinha, Asli Celikyilmaz, Antoine Bosselut, Tianlu Wang

Viaarxiv icon

PathFinder: Guided Search over Multi-Step Reasoning Paths

Add code
Bookmark button
Alert button
Dec 12, 2023
Olga Golovneva, Sean O'Brien, Ramakanth Pasunuru, Tianlu Wang, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

Figure 1 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 2 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 3 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 4 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Viaarxiv icon

DOMINO: A Dual-System for Multi-step Visual Language Reasoning

Add code
Bookmark button
Alert button
Oct 04, 2023
Peifang Wang, Olga Golovneva, Armen Aghajanyan, Xiang Ren, Muhao Chen, Asli Celikyilmaz, Maryam Fazel-Zarandi

Figure 1 for DOMINO: A Dual-System for Multi-step Visual Language Reasoning
Figure 2 for DOMINO: A Dual-System for Multi-step Visual Language Reasoning
Figure 3 for DOMINO: A Dual-System for Multi-step Visual Language Reasoning
Figure 4 for DOMINO: A Dual-System for Multi-step Visual Language Reasoning
Viaarxiv icon

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Add code
Bookmark button
Alert button
Sep 05, 2023
Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan

Figure 1 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 2 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 3 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 4 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Viaarxiv icon

Shepherd: A Critic for Language Model Generation

Add code
Bookmark button
Alert button
Aug 08, 2023
Tianlu Wang, Ping Yu, Xiaoqing Ellen Tan, Sean O'Brien, Ramakanth Pasunuru, Jane Dwivedi-Yu, Olga Golovneva, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

Figure 1 for Shepherd: A Critic for Language Model Generation
Figure 2 for Shepherd: A Critic for Language Model Generation
Figure 3 for Shepherd: A Critic for Language Model Generation
Figure 4 for Shepherd: A Critic for Language Model Generation
Viaarxiv icon

ALERT: Adapting Language Models to Reasoning Tasks

Add code
Bookmark button
Alert button
Dec 16, 2022
Ping Yu, Tianlu Wang, Olga Golovneva, Badr Alkhamissy, Gargi Ghosh, Mona Diab, Asli Celikyilmaz

Figure 1 for ALERT: Adapting Language Models to Reasoning Tasks
Figure 2 for ALERT: Adapting Language Models to Reasoning Tasks
Figure 3 for ALERT: Adapting Language Models to Reasoning Tasks
Figure 4 for ALERT: Adapting Language Models to Reasoning Tasks
Viaarxiv icon

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Add code
Bookmark button
Alert button
Dec 15, 2022
Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

Figure 1 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 2 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 3 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 4 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Viaarxiv icon

Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU

Add code
Bookmark button
Alert button
Dec 09, 2020
Olga Golovneva, Charith Peris

Figure 1 for Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU
Figure 2 for Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU
Figure 3 for Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU
Figure 4 for Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU
Viaarxiv icon