Picture for Xiangtai Li

Xiangtai Li

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding

Add code
Jun 16, 2025
Viaarxiv icon

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Add code
Jun 09, 2025
Figure 1 for CyberV: Cybernetics for Test-time Scaling in Video Understanding
Figure 2 for CyberV: Cybernetics for Test-time Scaling in Video Understanding
Figure 3 for CyberV: Cybernetics for Test-time Scaling in Video Understanding
Figure 4 for CyberV: Cybernetics for Test-time Scaling in Video Understanding
Viaarxiv icon

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Add code
May 30, 2025
Viaarxiv icon

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

Add code
May 30, 2025
Figure 1 for Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
Figure 2 for Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
Figure 3 for Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
Figure 4 for Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon

PixelThink: Towards Efficient Chain-of-Pixel Reasoning

Add code
May 29, 2025
Viaarxiv icon

So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Add code
May 24, 2025
Figure 1 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 2 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 3 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Figure 4 for So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection
Viaarxiv icon

Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Add code
May 22, 2025
Figure 1 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 2 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 3 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 4 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Viaarxiv icon

BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation

Add code
May 19, 2025
Figure 1 for BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation
Figure 2 for BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation
Figure 3 for BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation
Figure 4 for BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation
Viaarxiv icon