Alert button
Picture for Mehran Salmani

Mehran Salmani

Alert button

Iran University of Science and Technology

IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency

Add code
Bookmark button
Alert button
Aug 24, 2023
Saeid Ghafouri, Kamran Razavi, Mehran Salmani, Alireza Sanaee, Tania Lorido-Botran, Lin Wang, Joseph Doyle, Pooyan Jamshidi

Viaarxiv icon

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems

Add code
Bookmark button
Alert button
Apr 24, 2023
Mehran Salmani, Saeid Ghafouri, Alireza Sanaee, Kamran Razavi, Max Mühlhäuser, Joseph Doyle, Pooyan Jamshidi, Mohsen Sharifi

Figure 1 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 2 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 3 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 4 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Viaarxiv icon