Picture for Yuhang Han

Yuhang Han

STaR-KV: Spatio-Temporal Adaptive Re-weighting for KV Cache Compression in GUI Vision-Language Models

Add code
Jun 01, 2026
Viaarxiv icon

FlexDraft: Flexible Speculative Decoding via Attention Tuning and Bonus-Guided Calibration

Add code
May 19, 2026
Viaarxiv icon

Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models

Add code
Mar 28, 2026
Viaarxiv icon

Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation

Add code
Mar 25, 2026
Viaarxiv icon

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Add code
Jan 27, 2026
Viaarxiv icon

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Add code
Jan 09, 2025
Figure 1 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 2 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 3 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 4 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Viaarxiv icon

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Add code
Nov 26, 2024
Figure 1 for Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration
Figure 2 for Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration
Figure 3 for Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration
Figure 4 for Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration
Viaarxiv icon

DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba

Add code
Aug 07, 2024
Figure 1 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 2 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 3 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 4 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Viaarxiv icon

ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties

Add code
May 01, 2024
Viaarxiv icon

ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction

Add code
Apr 17, 2024
Figure 1 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 2 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 3 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 4 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Viaarxiv icon