Alert button
Picture for Abhinav Bhatele

Abhinav Bhatele

Alert button

Can Large Language Models Write Parallel Code?

Add code
Bookmark button
Alert button
Jan 23, 2024
Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele

Viaarxiv icon

Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization

Add code
Bookmark button
Alert button
Oct 27, 2023
Siddharth Singh, Zachary Sating, Abhinav Bhatele

Viaarxiv icon

Modeling Parallel Programs using Large Language Models

Add code
Bookmark button
Alert button
Jun 29, 2023
Daniel Nichols, Aniruddha Marathe, Harshitha Menon, Todd Gamblin, Abhinav Bhatele

Figure 1 for Modeling Parallel Programs using Large Language Models
Figure 2 for Modeling Parallel Programs using Large Language Models
Figure 3 for Modeling Parallel Programs using Large Language Models
Figure 4 for Modeling Parallel Programs using Large Language Models
Viaarxiv icon

Communication-minimizing Asynchronous Tensor Parallelism

Add code
Bookmark button
Alert button
May 22, 2023
Siddharth Singh, Zack Sating, Abhinav Bhatele

Figure 1 for Communication-minimizing Asynchronous Tensor Parallelism
Figure 2 for Communication-minimizing Asynchronous Tensor Parallelism
Figure 3 for Communication-minimizing Asynchronous Tensor Parallelism
Figure 4 for Communication-minimizing Asynchronous Tensor Parallelism
Viaarxiv icon

A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training

Add code
Bookmark button
Alert button
Mar 11, 2023
Siddharth Singh, Olatunji Ruwase, Ammar Ahmad Awan, Samyam Rajbhandari, Yuxiong He, Abhinav Bhatele

Figure 1 for A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training
Figure 2 for A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training
Figure 3 for A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training
Figure 4 for A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training
Viaarxiv icon

Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Add code
Bookmark button
Alert button
Feb 10, 2023
Siddharth Singh, Abhinav Bhatele

Figure 1 for Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Figure 2 for Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Figure 3 for Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Figure 4 for Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Viaarxiv icon

How to Train Your Neural Network: A Comparative Evaluation

Add code
Bookmark button
Alert button
Nov 09, 2021
Shu-Huai Lin, Daniel Nichols, Siddharth Singh, Abhinav Bhatele

Figure 1 for How to Train Your Neural Network: A Comparative Evaluation
Figure 2 for How to Train Your Neural Network: A Comparative Evaluation
Figure 3 for How to Train Your Neural Network: A Comparative Evaluation
Figure 4 for How to Train Your Neural Network: A Comparative Evaluation
Viaarxiv icon

Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning

Add code
Bookmark button
Alert button
Oct 26, 2021
Siddharth Singh, Abhinav Bhatele

Figure 1 for Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning
Figure 2 for Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning
Figure 3 for Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning
Figure 4 for Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning
Viaarxiv icon

Analytics of Longitudinal System Monitoring Data for Performance Prediction

Add code
Bookmark button
Alert button
Jul 07, 2020
Ian J. Costello, Abhinav Bhatele

Figure 1 for Analytics of Longitudinal System Monitoring Data for Performance Prediction
Figure 2 for Analytics of Longitudinal System Monitoring Data for Performance Prediction
Figure 3 for Analytics of Longitudinal System Monitoring Data for Performance Prediction
Figure 4 for Analytics of Longitudinal System Monitoring Data for Performance Prediction
Viaarxiv icon