Picture for Sicheng Zhao

Sicheng Zhao

DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

Add code
Jun 10, 2025
Viaarxiv icon

AdaTP: Attention-Debiased Token Pruning for Video Large Language Models

Add code
May 26, 2025
Viaarxiv icon

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Add code
May 22, 2025
Viaarxiv icon

Modality Reliability Guided Multimodal Recommendation

Add code
Apr 23, 2025
Viaarxiv icon

FastVID: Dynamic Density Pruning for Fast Video Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs

Add code
Mar 14, 2025
Viaarxiv icon

Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation

Add code
Dec 18, 2024
Figure 1 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 2 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 3 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 4 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Viaarxiv icon

From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision

Add code
Dec 15, 2024
Viaarxiv icon

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

Add code
Nov 26, 2024
Figure 1 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 2 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 3 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 4 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Viaarxiv icon

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Add code
Sep 02, 2024
Viaarxiv icon