Picture for Zechao Li

Zechao Li

SpectralGap: Graph-Level Out-of-Distribution Detection via Laplacian Eigenvalue Gaps

Add code
May 21, 2025
Viaarxiv icon

Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking

Add code
May 19, 2025
Viaarxiv icon

VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding

Add code
Apr 10, 2025
Viaarxiv icon

Visual Position Prompt for MLLM based Visual Grounding

Add code
Mar 19, 2025
Viaarxiv icon

A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models

Add code
Mar 17, 2025
Viaarxiv icon

OT-DETECTOR: Delving into Optimal Transport for Zero-shot Out-of-Distribution Detection

Add code
Mar 09, 2025
Viaarxiv icon

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly

Add code
Feb 09, 2025
Viaarxiv icon

AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation

Add code
Dec 23, 2024
Viaarxiv icon

Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception

Add code
Dec 18, 2024
Figure 1 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 2 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 3 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 4 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Viaarxiv icon

Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection

Add code
Dec 02, 2024
Viaarxiv icon