Picture for Wei Niu

Wei Niu

Efficient Pruning of Large Language Model with Adaptive Estimation Fusion

Add code
Mar 16, 2024
Viaarxiv icon

SoD$^2$: Statically Optimizing Dynamic Deep Neural Network

Add code
Feb 29, 2024
Viaarxiv icon

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge

Add code
Feb 16, 2024
Figure 1 for EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
Figure 2 for EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
Figure 3 for EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
Figure 4 for EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
Viaarxiv icon

Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges

Add code
Sep 14, 2023
Figure 1 for Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Figure 2 for Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Figure 3 for Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Viaarxiv icon

Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting

Add code
Mar 15, 2023
Figure 1 for Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Figure 2 for Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Figure 3 for Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Figure 4 for Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Viaarxiv icon

SparCL: Sparse Continual Learning on the Edge

Add code
Sep 20, 2022
Figure 1 for SparCL: Sparse Continual Learning on the Edge
Figure 2 for SparCL: Sparse Continual Learning on the Edge
Figure 3 for SparCL: Sparse Continual Learning on the Edge
Figure 4 for SparCL: Sparse Continual Learning on the Edge
Viaarxiv icon

Survey: Exploiting Data Redundancy for Optimization of Deep Learning

Add code
Aug 29, 2022
Figure 1 for Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Figure 2 for Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Figure 3 for Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Figure 4 for Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Viaarxiv icon

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution

Add code
Jul 25, 2022
Figure 1 for Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Figure 2 for Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Figure 3 for Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Figure 4 for Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Viaarxiv icon

Real-Time Portrait Stylization on the Edge

Add code
Jun 02, 2022
Figure 1 for Real-Time Portrait Stylization on the Edge
Figure 2 for Real-Time Portrait Stylization on the Edge
Figure 3 for Real-Time Portrait Stylization on the Edge
Figure 4 for Real-Time Portrait Stylization on the Edge
Viaarxiv icon

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

Add code
Dec 27, 2021
Figure 1 for SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Figure 2 for SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Figure 3 for SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Figure 4 for SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Viaarxiv icon