Picture for Hanxun Yu

Hanxun Yu

VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration

Add code
Jan 30, 2026
Viaarxiv icon

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Add code
Dec 18, 2025
Viaarxiv icon

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding

Add code
Dec 14, 2025
Viaarxiv icon

Physical Adversarial Attack meets Computer Vision: A Decade Survey

Add code
Sep 30, 2022
Figure 1 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 2 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 3 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 4 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Viaarxiv icon