Alert button

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers

Apr 10, 2024
Longwei Zou, Qingyang Wang, Han Zhao, Jiangang Kong, Yi Yang, Yangdong Deng

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: