Picture for Michael W. Mahoney

Michael W. Mahoney

UC Berkeley/LBNL/ICSI

Hessian Eigenspectra of More Realistic Nonlinear Models

Add code
Mar 17, 2021
Figure 1 for Hessian Eigenspectra of More Realistic Nonlinear Models
Figure 2 for Hessian Eigenspectra of More Realistic Nonlinear Models
Figure 3 for Hessian Eigenspectra of More Realistic Nonlinear Models
Figure 4 for Hessian Eigenspectra of More Realistic Nonlinear Models
Viaarxiv icon

A Differential Geometry Perspective on Orthogonal Recurrent Models

Add code
Feb 18, 2021
Figure 1 for A Differential Geometry Perspective on Orthogonal Recurrent Models
Figure 2 for A Differential Geometry Perspective on Orthogonal Recurrent Models
Figure 3 for A Differential Geometry Perspective on Orthogonal Recurrent Models
Figure 4 for A Differential Geometry Perspective on Orthogonal Recurrent Models
Viaarxiv icon

I-BERT: Integer-only BERT Quantization

Add code
Feb 11, 2021
Figure 1 for I-BERT: Integer-only BERT Quantization
Figure 2 for I-BERT: Integer-only BERT Quantization
Figure 3 for I-BERT: Integer-only BERT Quantization
Figure 4 for I-BERT: Integer-only BERT Quantization
Viaarxiv icon

Noisy Recurrent Neural Networks

Add code
Feb 09, 2021
Figure 1 for Noisy Recurrent Neural Networks
Figure 2 for Noisy Recurrent Neural Networks
Figure 3 for Noisy Recurrent Neural Networks
Figure 4 for Noisy Recurrent Neural Networks
Viaarxiv icon

Sparse sketches with small inversion bias

Add code
Nov 21, 2020
Viaarxiv icon

HAWQV3: Dyadic Neural Network Quantization

Add code
Nov 20, 2020
Figure 1 for HAWQV3: Dyadic Neural Network Quantization
Figure 2 for HAWQV3: Dyadic Neural Network Quantization
Figure 3 for HAWQV3: Dyadic Neural Network Quantization
Figure 4 for HAWQV3: Dyadic Neural Network Quantization
Viaarxiv icon

A Statistical Framework for Low-bitwidth Training of Deep Neural Networks

Add code
Oct 27, 2020
Figure 1 for A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Figure 2 for A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Figure 3 for A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Figure 4 for A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Viaarxiv icon

Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism

Add code
Oct 18, 2020
Figure 1 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 2 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 3 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 4 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Viaarxiv icon

MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding

Add code
Oct 12, 2020
Figure 1 for MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding
Figure 2 for MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding
Figure 3 for MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding
Figure 4 for MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding
Viaarxiv icon

Sparse Quantized Spectral Clustering

Add code
Oct 03, 2020
Figure 1 for Sparse Quantized Spectral Clustering
Figure 2 for Sparse Quantized Spectral Clustering
Figure 3 for Sparse Quantized Spectral Clustering
Figure 4 for Sparse Quantized Spectral Clustering
Viaarxiv icon