Picture for Kai Lu

Kai Lu

WindowQuant: Mixed-Precision KV Cache Quantization based on Window-Level Similarity for VLMs Inference Optimization

Add code
May 04, 2026
Viaarxiv icon

ScoutAttention: Efficient KV Cache Offloading via Layer-Ahead CPU Pre-computation for LLM Inference

Add code
Mar 28, 2026
Viaarxiv icon

CycleVLA: Proactive Self-Correcting Vision-Language-Action Models via Subtask Backtracking and Minimum Bayes Risk Decoding

Add code
Jan 05, 2026
Viaarxiv icon

DeepGo: Predictive Directed Greybox Fuzzing

Add code
Jul 29, 2025
Viaarxiv icon

MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts

Add code
Jun 09, 2025
Viaarxiv icon

MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs

Add code
Apr 13, 2025
Figure 1 for MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs
Figure 2 for MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs
Figure 3 for MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs
Figure 4 for MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs
Viaarxiv icon

RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations

Add code
Mar 28, 2025
Viaarxiv icon

RefSAM3D: Adapting SAM with Cross-modal Reference for 3D Medical Image Segmentation

Add code
Dec 07, 2024
Viaarxiv icon

Hammer: Towards Efficient Hot-Cold Data Identification via Online Learning

Add code
Nov 22, 2024
Viaarxiv icon

InteLiPlan: Interactive Lightweight LLM-Based Planner for Domestic Robot Autonomy

Add code
Sep 22, 2024
Viaarxiv icon