Alert button
Picture for Abhinav Venigalla

Abhinav Venigalla

Alert button

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Add code
Bookmark button
Alert button
Mar 27, 2024
Elliot Bolton, Abhinav Venigalla, Michihiro Yasunaga, David Hall, Betty Xiong, Tony Lee, Roxana Daneshjou, Jonathan Frankle, Percy Liang, Michael Carbin, Christopher D. Manning

Viaarxiv icon

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

Add code
Bookmark button
Alert button
Jan 16, 2024
Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

Viaarxiv icon

Representation range needs for 16-bit neural network training

Add code
Bookmark button
Alert button
Apr 06, 2021
Valentina Popescu, Abhinav Venigalla, Di Wu, Robert Schreiber

Figure 1 for Representation range needs for 16-bit neural network training
Figure 2 for Representation range needs for 16-bit neural network training
Figure 3 for Representation range needs for 16-bit neural network training
Figure 4 for Representation range needs for 16-bit neural network training
Viaarxiv icon

Adaptive Braking for Mitigating Gradient Delay

Add code
Bookmark button
Alert button
Jul 10, 2020
Abhinav Venigalla, Atli Kosson, Vitaliy Chiley, Urs Köster

Figure 1 for Adaptive Braking for Mitigating Gradient Delay
Figure 2 for Adaptive Braking for Mitigating Gradient Delay
Figure 3 for Adaptive Braking for Mitigating Gradient Delay
Figure 4 for Adaptive Braking for Mitigating Gradient Delay
Viaarxiv icon

Pipelined Backpropagation at Scale: Training Large Models without Batches

Add code
Bookmark button
Alert button
Mar 25, 2020
Atli Kosson, Vitaliy Chiley, Abhinav Venigalla, Joel Hestness, Urs Köster

Figure 1 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 2 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 3 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 4 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Viaarxiv icon