Picture for Xiaolei Chen

Xiaolei Chen

Training-Free Pyramid Token Pruning for Efficient Large Vision-Language Models via Region, Token, and Instruction-Guided Importance

Add code
Sep 19, 2025
Viaarxiv icon

HERO: Rethinking Visual Token Early Dropping in High-Resolution Large Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

Global Semantic-Guided Sub-image Feature Weight Allocation in High-Resolution Large Vision-Language Models

Add code
Jan 24, 2025
Viaarxiv icon

A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection

Add code
Oct 13, 2023
Figure 1 for A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection
Figure 2 for A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection
Figure 3 for A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection
Figure 4 for A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection
Viaarxiv icon