Picture for Saeid Ghafouri

Saeid Ghafouri

Queen Mary University of London, University of South Carolina

ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving

Add code
Apr 08, 2026
Viaarxiv icon

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching

Add code
Jan 15, 2026
Viaarxiv icon

SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving

Add code
Jun 11, 2025
Viaarxiv icon

IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency

Add code
Aug 24, 2023
Viaarxiv icon

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems

Add code
Apr 24, 2023
Figure 1 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 2 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 3 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Figure 4 for Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Viaarxiv icon

Opinion Leader Detection in Online Social Networks Based on Output and Input Links

Add code
Aug 28, 2022
Figure 1 for Opinion Leader Detection in Online Social Networks Based on Output and Input Links
Figure 2 for Opinion Leader Detection in Online Social Networks Based on Output and Input Links
Figure 3 for Opinion Leader Detection in Online Social Networks Based on Output and Input Links
Figure 4 for Opinion Leader Detection in Online Social Networks Based on Output and Input Links
Viaarxiv icon