Picture for Yicheng Ji

Yicheng Ji

Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models

Add code
May 10, 2026
Viaarxiv icon

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Add code
Apr 07, 2026
Viaarxiv icon

ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding

Add code
Mar 23, 2026
Viaarxiv icon