Alert button
Picture for Geng Yuan

Geng Yuan

Alert button

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning

Add code
Bookmark button
Alert button
Nov 20, 2020
Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao

Figure 1 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 2 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 3 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 4 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Viaarxiv icon

Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization

Add code
Bookmark button
Alert button
Sep 15, 2020
Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang

Figure 1 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 2 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 3 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 4 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Viaarxiv icon

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design

Add code
Bookmark button
Alert button
Sep 12, 2020
Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang

Figure 1 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 2 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 3 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 4 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Viaarxiv icon

SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency

Add code
Bookmark button
Alert button
Jan 23, 2020
Zhengang Li, Yifan Gong, Xiaolong Ma, Sijia Liu, Mengshu Sun, Zheng Zhan, Zhenglun Kong, Geng Yuan, Yanzhi Wang

Figure 1 for SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency
Figure 2 for SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency
Figure 3 for SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency
Figure 4 for SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency
Viaarxiv icon

A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation

Add code
Bookmark button
Alert button
Nov 24, 2019
Geng Yuan, Xiaolong Ma, Sheng Lin, Zhengang Li, Caiwen Ding

Figure 1 for A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation
Figure 2 for A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation
Figure 3 for A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation
Figure 4 for A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation
Viaarxiv icon

An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM

Add code
Bookmark button
Alert button
Aug 29, 2019
Geng Yuan, Xiaolong Ma, Caiwen Ding, Sheng Lin, Tianyun Zhang, Zeinab S. Jalali, Yilong Zhao, Li Jiang, Sucheta Soundarajan, Yanzhi Wang

Figure 1 for An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM
Figure 2 for An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM
Figure 3 for An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM
Figure 4 for An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM
Viaarxiv icon

Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation

Add code
Bookmark button
Alert button
Aug 27, 2019
Xiaolong Ma, Geng Yuan, Sheng Lin, Caiwen Ding, Fuxun Yu, Tao Liu, Wujie Wen, Xiang Chen, Yanzhi Wang

Figure 1 for Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
Figure 2 for Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
Figure 3 for Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
Figure 4 for Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation
Viaarxiv icon

Non-structured DNN Weight Pruning Considered Harmful

Add code
Bookmark button
Alert button
Jul 03, 2019
Yanzhi Wang, Shaokai Ye, Zhezhi He, Xiaolong Ma, Linfeng Zhang, Sheng Lin, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma

Figure 1 for Non-structured DNN Weight Pruning Considered Harmful
Figure 2 for Non-structured DNN Weight Pruning Considered Harmful
Figure 3 for Non-structured DNN Weight Pruning Considered Harmful
Figure 4 for Non-structured DNN Weight Pruning Considered Harmful
Viaarxiv icon

Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM

Add code
Bookmark button
Alert button
May 02, 2019
Sheng Lin, Xiaolong Ma, Shaokai Ye, Geng Yuan, Kaisheng Ma, Yanzhi Wang

Figure 1 for Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Figure 2 for Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Figure 3 for Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Figure 4 for Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Viaarxiv icon

ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Add code
Bookmark button
Alert button
Apr 30, 2019
Xiaolong Ma, Geng Yuan, Sheng Lin, Zhengang Li, Hao Sun, Yanzhi Wang

Figure 1 for ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning
Figure 2 for ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning
Figure 3 for ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning
Figure 4 for ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning
Viaarxiv icon