Picture for Huan Li

Huan Li

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Add code
Apr 07, 2026
Viaarxiv icon

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification

Add code
Mar 27, 2026
Viaarxiv icon

ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding

Add code
Mar 23, 2026
Viaarxiv icon

MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

Add code
Mar 16, 2026
Viaarxiv icon

VERA: Identifying and Leveraging Visual Evidence Retrieval Heads in Long-Context Understanding

Add code
Feb 09, 2026
Viaarxiv icon

Variance-Adaptive Muon: Accelerating LLM Pretraining with NSR-Modulated and Variance-Scaled Momentum

Add code
Jan 21, 2026
Viaarxiv icon

Convergence Rate Analysis of the AdamW-Style Shampoo: Unifying One-sided and Two-Sided Preconditioning

Add code
Jan 12, 2026
Viaarxiv icon

SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses

Add code
Jan 05, 2026
Viaarxiv icon