Alert button
Picture for Raul Puri

Raul Puri

Alert button

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Add code
Bookmark button
Alert button
Oct 05, 2019
Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro

Figure 1 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 2 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 3 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 4 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Viaarxiv icon

Practical Text Classification With Large Pre-Trained Language Models

Add code
Bookmark button
Alert button
Dec 04, 2018
Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro

Figure 1 for Practical Text Classification With Large Pre-Trained Language Models
Figure 2 for Practical Text Classification With Large Pre-Trained Language Models
Figure 3 for Practical Text Classification With Large Pre-Trained Language Models
Figure 4 for Practical Text Classification With Large Pre-Trained Language Models
Viaarxiv icon

Large Scale Language Modeling: Converging on 40GB of Text in Four Hours

Add code
Bookmark button
Alert button
Aug 11, 2018
Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro

Figure 1 for Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Figure 2 for Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Figure 3 for Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Figure 4 for Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Viaarxiv icon