Picture for Hanxun Yu

Hanxun Yu

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Add code
Dec 18, 2025
Viaarxiv icon

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding

Add code
Dec 14, 2025
Viaarxiv icon

Physical Adversarial Attack meets Computer Vision: A Decade Survey

Add code
Sep 30, 2022
Figure 1 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 2 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 3 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Figure 4 for Physical Adversarial Attack meets Computer Vision: A Decade Survey
Viaarxiv icon