Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MNN: A Universal and Efficient Inference Engine

Feb 27, 2020

Xiaotang Jiang, Huan Wang, Yiliu Chen, Ziqi Wu, Lichuan Wang, Bin Zou, Yafeng Yang, Zongyang Cui, Yu Cai, Tianhang Yu(+2 more)

Figure 1 for MNN: A Universal and Efficient Inference Engine

Figure 2 for MNN: A Universal and Efficient Inference Engine

Figure 3 for MNN: A Universal and Efficient Inference Engine

Figure 4 for MNN: A Universal and Efficient Inference Engine

Share this with someone who'll enjoy it:

Abstract:Deploying deep learning models on mobile devices draws more and more attention recently. However, designing an efficient inference engine on devices is under the great challenges of model compatibility, device diversity, and resource limitation. To deal with these challenges, we propose Mobile Neural Network (MNN), a universal and efficient inference engine tailored to mobile applications. In this paper, the contributions of MNN include: (1) presenting a mechanism called pre-inference that manages to conduct runtime optimization; (2)deliveringthorough kernel optimization on operators to achieve optimal computation performance; (3) introducing backend abstraction module which enables hybrid scheduling and keeps the engine lightweight. Extensive benchmark experiments demonstrate that MNN performs favorably against other popular lightweight deep learning frameworks. MNN is available to public at: https://github.com/alibaba/MNN.

* Accepted by MLSys 2020

View paper on

Share this with someone who'll enjoy it:

Title:MNN: A Universal and Efficient Inference Engine

Paper and Code