Picture for Xu Sun

Xu Sun

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Add code
May 29, 2025
Viaarxiv icon

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Add code
May 28, 2025
Viaarxiv icon

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

Add code
Apr 24, 2025
Viaarxiv icon

UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

Add code
Mar 13, 2025
Viaarxiv icon

Generative Frame Sampler for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

Add code
Feb 12, 2025
Viaarxiv icon

VidTwin: Video VAE with Decoupled Structure and Dynamics

Add code
Dec 23, 2024
Viaarxiv icon

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

Add code
Dec 16, 2024
Figure 1 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 2 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 3 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 4 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Viaarxiv icon

Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment

Add code
Nov 25, 2024
Figure 1 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 2 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 3 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 4 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Viaarxiv icon

Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction

Add code
Oct 11, 2024
Figure 1 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 2 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 3 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 4 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Viaarxiv icon