Picture for Fan Lai

Fan Lai

Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models

Add code
Jun 11, 2025
Viaarxiv icon

Single-agent or Multi-agent Systems? Why Not Both?

Add code
May 23, 2025
Viaarxiv icon

Tempo: Application-aware LLM Serving with Mixed SLO Requirements

Add code
Apr 24, 2025
Viaarxiv icon

Circinus: Efficient Query Planner for Compound ML Serving

Add code
Apr 23, 2025
Viaarxiv icon

DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services

Add code
Feb 17, 2025
Viaarxiv icon

Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services

Add code
Apr 25, 2024
Figure 1 for Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
Figure 2 for Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
Figure 3 for Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
Figure 4 for Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
Viaarxiv icon

FedTrans: Efficient Federated Learning via Multi-Model Transformation

Add code
Apr 25, 2024
Viaarxiv icon

Learn To be Efficient: Build Structured Sparsity in Large Language Models

Add code
Feb 13, 2024
Figure 1 for Learn To be Efficient: Build Structured Sparsity in Large Language Models
Figure 2 for Learn To be Efficient: Build Structured Sparsity in Large Language Models
Figure 3 for Learn To be Efficient: Build Structured Sparsity in Large Language Models
Figure 4 for Learn To be Efficient: Build Structured Sparsity in Large Language Models
Viaarxiv icon

Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Add code
Jan 09, 2024
Figure 1 for Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Figure 2 for Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Figure 3 for Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Figure 4 for Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Viaarxiv icon

Venn: Resource Management Across Federated Learning Jobs

Add code
Dec 13, 2023
Figure 1 for Venn: Resource Management Across Federated Learning Jobs
Figure 2 for Venn: Resource Management Across Federated Learning Jobs
Figure 3 for Venn: Resource Management Across Federated Learning Jobs
Figure 4 for Venn: Resource Management Across Federated Learning Jobs
Viaarxiv icon