Picture for Huan Li

Huan Li

MV-Actor: Aligning Multi-View Semantics and Spatial Awareness for Bimanual Manipulation

Add code
Jun 09, 2026
Viaarxiv icon

LATTEArena: An Evaluation Framework for LLM-powered Tabular Feature Engineering (Extended Version)

Add code
Jun 08, 2026
Viaarxiv icon

AR Forcing: Towards Long-Horizon Robot Navigation World Model

Add code
May 29, 2026
Viaarxiv icon

Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection

Add code
May 28, 2026
Viaarxiv icon

MedMemoryBench: Benchmarking Agent Memory in Personalized Healthcare

Add code
May 12, 2026
Viaarxiv icon

Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models

Add code
May 10, 2026
Viaarxiv icon

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Add code
Apr 07, 2026
Viaarxiv icon

Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification

Add code
Mar 27, 2026
Viaarxiv icon