Alert button
Picture for Michael Mahoney

Michael Mahoney

Alert button

Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior

Add code
Bookmark button
Alert button
Jun 01, 2023
Shashank Subramanian, Peter Harrington, Kurt Keutzer, Wahid Bhimji, Dmitriy Morozov, Michael Mahoney, Amir Gholami

Figure 1 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 2 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 3 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 4 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Viaarxiv icon

GACT: Activation Compressed Training for General Architectures

Add code
Bookmark button
Alert button
Jun 28, 2022
Xiaoxuan Liu, Lianmin Zheng, Dequan Wang, Yukuo Cen, Weize Chen, Xu Han, Jianfei Chen, Zhiyuan Liu, Jie Tang, Joey Gonzalez, Michael Mahoney, Alvin Cheung

Figure 1 for GACT: Activation Compressed Training for General Architectures
Figure 2 for GACT: Activation Compressed Training for General Architectures
Figure 3 for GACT: Activation Compressed Training for General Architectures
Figure 4 for GACT: Activation Compressed Training for General Architectures
Viaarxiv icon

AutoIP: A United Framework to Integrate Physics into Gaussian Processes

Add code
Bookmark button
Alert button
Feb 24, 2022
Da Long, Zheng Wang, Aditi Krishnapriyan, Robert Kirby, Shandian Zhe, Michael Mahoney

Figure 1 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 2 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 3 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Figure 4 for AutoIP: A United Framework to Integrate Physics into Gaussian Processes
Viaarxiv icon

LocalNewton: Reducing Communication Bottleneck for Distributed Learning

Add code
Bookmark button
Alert button
May 16, 2021
Vipul Gupta, Avishek Ghosh, Michal Derezinski, Rajiv Khanna, Kannan Ramchandran, Michael Mahoney

Figure 1 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 2 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 3 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Figure 4 for LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Viaarxiv icon

Rethinking Batch Normalization in Transformers

Add code
Bookmark button
Alert button
Mar 17, 2020
Sheng Shen, Zhewei Yao, Amir Gholami, Michael Mahoney, Kurt Keutzer

Figure 1 for Rethinking Batch Normalization in Transformers
Figure 2 for Rethinking Batch Normalization in Transformers
Figure 3 for Rethinking Batch Normalization in Transformers
Figure 4 for Rethinking Batch Normalization in Transformers
Viaarxiv icon

PyHessian: Neural Networks Through the Lens of the Hessian

Add code
Bookmark button
Alert button
Jan 02, 2020
Zhewei Yao, Amir Gholami, Kurt Keutzer, Michael Mahoney

Figure 1 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 2 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 3 for PyHessian: Neural Networks Through the Lens of the Hessian
Figure 4 for PyHessian: Neural Networks Through the Lens of the Hessian
Viaarxiv icon

ANODEV2: A Coupled Neural ODE Evolution Framework

Add code
Bookmark button
Alert button
Jun 10, 2019
Tianjun Zhang, Zhewei Yao, Amir Gholami, Kurt Keutzer, Joseph Gonzalez, George Biros, Michael Mahoney

Figure 1 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 2 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 3 for ANODEV2: A Coupled Neural ODE Evolution Framework
Figure 4 for ANODEV2: A Coupled Neural ODE Evolution Framework
Viaarxiv icon

HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision

Add code
Bookmark button
Alert button
Apr 29, 2019
Zhen Dong, Zhewei Yao, Amir Gholami, Michael Mahoney, Kurt Keutzer

Figure 1 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 2 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 3 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Figure 4 for HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Viaarxiv icon