Picture for Kurt Keutzer

Kurt Keutzer

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness

Add code
Jan 15, 2024
Viaarxiv icon

Learned Best-Effort LLM Serving

Add code
Jan 15, 2024
Figure 1 for Learned Best-Effort LLM Serving
Figure 2 for Learned Best-Effort LLM Serving
Figure 3 for Learned Best-Effort LLM Serving
Figure 4 for Learned Best-Effort LLM Serving
Viaarxiv icon

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation

Add code
Dec 27, 2023
Figure 1 for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Figure 2 for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Figure 3 for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Figure 4 for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Viaarxiv icon

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Add code
Dec 19, 2023
Viaarxiv icon

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting

Add code
Dec 14, 2023
Figure 1 for Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Figure 2 for Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Figure 3 for Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Figure 4 for Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Viaarxiv icon

An LLM Compiler for Parallel Function Calling

Add code
Dec 07, 2023
Viaarxiv icon

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration

Add code
Nov 16, 2023
Figure 1 for MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
Figure 2 for MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
Figure 3 for MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
Figure 4 for MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
Viaarxiv icon

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome

Add code
Nov 12, 2023
Viaarxiv icon

Simple and Effective Input Reformulations for Translation

Add code
Nov 12, 2023
Viaarxiv icon

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Add code
Nov 07, 2023
Figure 1 for S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Figure 2 for S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Figure 3 for S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Figure 4 for S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Viaarxiv icon