Alert button
Picture for Yangqing Jia

Yangqing Jia

Alert button

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Mar 07, 2024
Muyang Li, Tianle Cai, Jiaxin Cao, Qinsheng Zhang, Han Cai, Junjie Bai, Yangqing Jia, Ming-Yu Liu, Kai Li, Song Han

Viaarxiv icon

Characterizing Deep Learning Training Workloads on Alibaba-PAI

Oct 14, 2019
Mengdi Wang, Chen Meng, Guoping Long, Chuan Wu, Jun Yang, Wei Lin, Yangqing Jia

Figure 1 for Characterizing Deep Learning Training Workloads on Alibaba-PAI
Figure 2 for Characterizing Deep Learning Training Workloads on Alibaba-PAI
Figure 3 for Characterizing Deep Learning Training Workloads on Alibaba-PAI
Figure 4 for Characterizing Deep Learning Training Workloads on Alibaba-PAI
Viaarxiv icon

ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation

Dec 21, 2018
Xiaoliang Dai, Peizhao Zhang, Bichen Wu, Hongxu Yin, Fei Sun, Yanghan Wang, Marat Dukhan, Yunqing Hu, Yiming Wu, Yangqing Jia, Peter Vajda, Matt Uyttendaele, Niraj K. Jha

Figure 1 for ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Figure 2 for ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Figure 3 for ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Figure 4 for ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Viaarxiv icon

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search

Dec 14, 2018
Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, Kurt Keutzer

Figure 1 for FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Figure 2 for FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Figure 3 for FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Figure 4 for FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Viaarxiv icon

Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

Nov 29, 2018
Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Khudia, James Law, Parth Malani, Andrey Malevich, Satish Nadathur, Juan Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao, Nadav Rotem, Sungjoo Yoo, Mikhail Smelyanskiy

Figure 1 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 2 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 3 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Figure 4 for Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Viaarxiv icon

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Apr 30, 2018
Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, Kaiming He

Figure 1 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 2 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 3 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Figure 4 for Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Viaarxiv icon

High performance ultra-low-precision convolutions on mobile devices

Dec 06, 2017
Andrew Tulloch, Yangqing Jia

Figure 1 for High performance ultra-low-precision convolutions on mobile devices
Viaarxiv icon

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Mar 16, 2016
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viegas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, Xiaoqiang Zheng

Figure 1 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 2 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 3 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 4 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Viaarxiv icon