Picture for Xiawu Zheng

Xiawu Zheng

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Add code
Jul 10, 2024
Viaarxiv icon

Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion

Add code
Jun 28, 2024
Viaarxiv icon

Local Manifold Learning for No-Reference Image Quality Assessment

Add code
Jun 27, 2024
Viaarxiv icon

Depth-Guided Semi-Supervised Instance Segmentation

Add code
Jun 25, 2024
Viaarxiv icon

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

Add code
Jun 14, 2024
Viaarxiv icon

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Add code
May 31, 2024
Figure 1 for Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Figure 2 for Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Figure 3 for Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Figure 4 for Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Viaarxiv icon

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Add code
May 16, 2024
Viaarxiv icon

GraCo: Granularity-Controllable Interactive Segmentation

Add code
May 01, 2024
Viaarxiv icon

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM

Add code
Apr 24, 2024
Figure 1 for Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Figure 2 for Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Figure 3 for Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Figure 4 for Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Viaarxiv icon

Multi-Modal Prompt Learning on Blind Image Quality Assessment

Add code
Apr 23, 2024
Figure 1 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 2 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 3 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 4 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Viaarxiv icon