Alert button

Opara: Exploiting Operator Parallelism for Expediting DNN Inference on GPUs

Dec 16, 2023
Aodong Chen, Fei Xu, Li Han, Yuan Dong, Li Chen, Zhi Zhou, Fangming Liu

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: