Alert button
Picture for Mohammad Shoeybi

Mohammad Shoeybi

Alert button

Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases

Add code
Bookmark button
Alert button
Dec 15, 2021
Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro

Figure 1 for Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases
Figure 2 for Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases
Figure 3 for Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases
Figure 4 for Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases
Viaarxiv icon

Long-Short Transformer: Efficient Transformers for Language and Vision

Add code
Bookmark button
Alert button
Jul 27, 2021
Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro

Figure 1 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 2 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 3 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 4 for Long-Short Transformer: Efficient Transformers for Language and Vision
Viaarxiv icon

Efficient Large-Scale Language Model Training on GPU Clusters

Add code
Bookmark button
Alert button
Apr 09, 2021
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia

Figure 1 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 2 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 3 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 4 for Efficient Large-Scale Language Model Training on GPU Clusters
Viaarxiv icon

End-to-End Training of Neural Retrievers for Open-Domain Question Answering

Add code
Bookmark button
Alert button
Jan 02, 2021
Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L Hamilton, Bryan Catanzaro

Figure 1 for End-to-End Training of Neural Retrievers for Open-Domain Question Answering
Figure 2 for End-to-End Training of Neural Retrievers for Open-Domain Question Answering
Figure 3 for End-to-End Training of Neural Retrievers for Open-Domain Question Answering
Figure 4 for End-to-End Training of Neural Retrievers for Open-Domain Question Answering
Viaarxiv icon

Local Knowledge Powered Conversational Agents

Add code
Bookmark button
Alert button
Oct 20, 2020
Sashank Santhanam, Wei Ping, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro

Figure 1 for Local Knowledge Powered Conversational Agents
Figure 2 for Local Knowledge Powered Conversational Agents
Figure 3 for Local Knowledge Powered Conversational Agents
Figure 4 for Local Knowledge Powered Conversational Agents
Viaarxiv icon

BioMegatron: Larger Biomedical Domain Language Model

Add code
Bookmark button
Alert button
Oct 14, 2020
Hoo-Chang Shin, Yang Zhang, Evelina Bakhturina, Raul Puri, Mostofa Patwary, Mohammad Shoeybi, Raghav Mani

Figure 1 for BioMegatron: Larger Biomedical Domain Language Model
Figure 2 for BioMegatron: Larger Biomedical Domain Language Model
Figure 3 for BioMegatron: Larger Biomedical Domain Language Model
Figure 4 for BioMegatron: Larger Biomedical Domain Language Model
Viaarxiv icon

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models

Add code
Bookmark button
Alert button
Oct 02, 2020
Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro

Figure 1 for MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models
Figure 2 for MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models
Figure 3 for MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models
Figure 4 for MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models
Viaarxiv icon

Large Scale Multi-Actor Generative Dialog Modeling

Add code
Bookmark button
Alert button
May 13, 2020
Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro

Figure 1 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 2 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 3 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 4 for Large Scale Multi-Actor Generative Dialog Modeling
Viaarxiv icon