Picture for Vahid Partovi Nia

Vahid Partovi Nia

Huawei Noah's Ark Lab

Kronecker Decomposition for GPT Compression

Add code
Oct 15, 2021
Figure 1 for Kronecker Decomposition for GPT Compression
Figure 2 for Kronecker Decomposition for GPT Compression
Figure 3 for Kronecker Decomposition for GPT Compression
Figure 4 for Kronecker Decomposition for GPT Compression
Viaarxiv icon

Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition

Add code
Sep 29, 2021
Figure 1 for Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Figure 2 for Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Figure 3 for Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Figure 4 for Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Viaarxiv icon

iRNN: Integer-only Recurrent Neural Network

Add code
Sep 20, 2021
Figure 1 for iRNN: Integer-only Recurrent Neural Network
Figure 2 for iRNN: Integer-only Recurrent Neural Network
Figure 3 for iRNN: Integer-only Recurrent Neural Network
Figure 4 for iRNN: Integer-only Recurrent Neural Network
Viaarxiv icon

KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation

Add code
Sep 13, 2021
Figure 1 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 2 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 3 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 4 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Viaarxiv icon

$S^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks

Add code
Jul 07, 2021
Figure 1 for $S^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
Figure 2 for $S^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
Figure 3 for $S^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
Figure 4 for $S^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks
Viaarxiv icon

A Twin Neural Model for Uplift

Add code
May 11, 2021
Figure 1 for A Twin Neural Model for Uplift
Figure 2 for A Twin Neural Model for Uplift
Figure 3 for A Twin Neural Model for Uplift
Figure 4 for A Twin Neural Model for Uplift
Viaarxiv icon

Tensor train decompositions on recurrent networks

Add code
Jun 09, 2020
Figure 1 for Tensor train decompositions on recurrent networks
Figure 2 for Tensor train decompositions on recurrent networks
Figure 3 for Tensor train decompositions on recurrent networks
Figure 4 for Tensor train decompositions on recurrent networks
Viaarxiv icon

Clustering Causal Additive Noise Models

Add code
Jun 08, 2020
Figure 1 for Clustering Causal Additive Noise Models
Figure 2 for Clustering Causal Additive Noise Models
Figure 3 for Clustering Causal Additive Noise Models
Figure 4 for Clustering Causal Additive Noise Models
Viaarxiv icon

Batch Normalization in Quantized Networks

Add code
Apr 29, 2020
Viaarxiv icon

Importance of Data Loading Pipeline in Training Deep Neural Networks

Add code
Apr 21, 2020
Figure 1 for Importance of Data Loading Pipeline in Training Deep Neural Networks
Figure 2 for Importance of Data Loading Pipeline in Training Deep Neural Networks
Figure 3 for Importance of Data Loading Pipeline in Training Deep Neural Networks
Figure 4 for Importance of Data Loading Pipeline in Training Deep Neural Networks
Viaarxiv icon