Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Wei Niu

Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device


Dec 26, 2020
Pu Zhao, Wei Niu, Geng Yuan, Yuxuan Cai, Hsin-Hsuan Sung, Wujie Wen, Sijia Liu, Xipeng Shen, Bin Ren, Yanzhi Wang, Xue Lin


  Access Paper or Ask Questions

6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration


Dec 01, 2020
Zhengang Li, Geng Yuan, Wei Niu, Yanyu Li, Pu Zhao, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin


  Access Paper or Ask Questions

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning


Nov 20, 2020
Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao

* 11 pages, 13 figures, 2 tables 

  Access Paper or Ask Questions

Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization


Sep 15, 2020
Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang


  Access Paper or Ask Questions

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design


Sep 12, 2020
Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang


  Access Paper or Ask Questions

Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices


Jul 20, 2020
Wei Niu, Mengshu Sun, Zhengang Li, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Xue Lin, Bin Ren


  Access Paper or Ask Questions

Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization


Apr 22, 2020
Wei Niu, Pu Zhao, Zheng Zhan, Xue Lin, Yanzhi Wang, Bin Ren

* accepted by the IJCAI-PRICAI 2020 Demonstrations Track 

  Access Paper or Ask Questions

A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework


Mar 13, 2020
Zheng Zhan, Yifan Gong, Zhengang Li, Pu Zhao, Xiaolong Ma, Wei Niu, Xiaolin Xu, Bin Ren, Yanzhi Wang, Xue Lin


  Access Paper or Ask Questions

BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method


Feb 22, 2020
Xiaolong Ma, Zhengang Li, Yifan Gong, Tianyun Zhang, Wei Niu, Zheng Zhan, Pu Zhao, Jian Tang, Xue Lin, Bin Ren, Yanzhi Wang


  Access Paper or Ask Questions

An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices


Feb 22, 2020
Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang

* arXiv admin note: text overlap with arXiv:1909.05073 

  Access Paper or Ask Questions

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition


Feb 19, 2020
Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Yanzhi Wang, Dingwen Tao


  Access Paper or Ask Questions

Attentional Speech Recognition Models Misbehave on Out-of-domain Utterances


Feb 12, 2020
Phillip Keung, Wei Niu, Yichao Lu, Julian Salazar, Vikas Bhardwaj

* Artifacts like our filtered Audio BNC dataset can be found at https://github.com/aws-samples/seq2seq-asr-misbehaves 

  Access Paper or Ask Questions

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning


Jan 22, 2020
Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren

* To be published in the Proceedings of Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 20) 

  Access Paper or Ask Questions

PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices


Sep 12, 2019
Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang


  Access Paper or Ask Questions

26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone


May 02, 2019
Wei Niu, Xiaolong Ma, Yanzhi Wang, Bin Ren


  Access Paper or Ask Questions