Alert button
Picture for Shrimai Prabhumoye

Shrimai Prabhumoye

Alert button

Nemotron-4 15B Technical Report

Add code
Bookmark button
Alert button
Feb 27, 2024
Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick LeGresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi, Jonathan Cohen, Bryan Catanzaro

Viaarxiv icon

SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning

Add code
Bookmark button
Alert button
May 24, 2023
Yue Wu, So Yeon Min, Shrimai Prabhumoye, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Tom Mitchell, Yuanzhi Li

Figure 1 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 2 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 3 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 4 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Viaarxiv icon

Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents

Add code
Bookmark button
Alert button
May 07, 2023
Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye

Figure 1 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 2 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 3 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 4 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Viaarxiv icon

Self-Refine: Iterative Refinement with Self-Feedback

Add code
Bookmark button
Alert button
Mar 30, 2023
Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark

Figure 1 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 2 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 3 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 4 for Self-Refine: Iterative Refinement with Self-Feedback
Viaarxiv icon

Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models

Add code
Bookmark button
Alert button
Feb 14, 2023
Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
Figure 2 for Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
Figure 3 for Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
Figure 4 for Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
Viaarxiv icon

AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models

Add code
Bookmark button
Alert button
Feb 14, 2023
Rafal Kocielnik, Shrimai Prabhumoye, Vivian Zhang, R. Michael Alvarez, Anima Anandkumar

Figure 1 for AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models
Figure 2 for AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models
Figure 3 for AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models
Figure 4 for AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models
Viaarxiv icon

Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions

Add code
Bookmark button
Alert button
Nov 21, 2022
Rafal Kocielnik, Sara Kangaslahti, Shrimai Prabhumoye, Meena Hari, R. Michael Alvarez, Anima Anandkumar

Figure 1 for Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions
Figure 2 for Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions
Figure 3 for Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions
Figure 4 for Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions
Viaarxiv icon

Evaluating Parameter Efficient Learning for Generation

Add code
Bookmark button
Alert button
Oct 25, 2022
Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan J. Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for Evaluating Parameter Efficient Learning for Generation
Figure 2 for Evaluating Parameter Efficient Learning for Generation
Figure 3 for Evaluating Parameter Efficient Learning for Generation
Figure 4 for Evaluating Parameter Efficient Learning for Generation
Viaarxiv icon

Context Generation Improves Open Domain Question Answering

Add code
Bookmark button
Alert button
Oct 12, 2022
Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro

Figure 1 for Context Generation Improves Open Domain Question Answering
Figure 2 for Context Generation Improves Open Domain Question Answering
Figure 3 for Context Generation Improves Open Domain Question Answering
Figure 4 for Context Generation Improves Open Domain Question Answering
Viaarxiv icon