Picture for Ivan Chelombiev

Ivan Chelombiev

SparQ Attention: Bandwidth-Efficient LLM Inference

Add code
Dec 08, 2023
Viaarxiv icon

Towards Structured Dynamic Sparse Pre-Training of BERT

Add code
Aug 13, 2021
Figure 1 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 2 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 3 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 4 for Towards Structured Dynamic Sparse Pre-Training of BERT
Viaarxiv icon

GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures

Add code
Jun 10, 2021
Figure 1 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 2 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 3 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 4 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Viaarxiv icon

Adaptive Estimators Show Information Compression in Deep Neural Networks

Add code
Feb 24, 2019
Figure 1 for Adaptive Estimators Show Information Compression in Deep Neural Networks
Figure 2 for Adaptive Estimators Show Information Compression in Deep Neural Networks
Figure 3 for Adaptive Estimators Show Information Compression in Deep Neural Networks
Figure 4 for Adaptive Estimators Show Information Compression in Deep Neural Networks
Viaarxiv icon