Picture for Xintao Wang

Xintao Wang

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Add code
Nov 13, 2025
Figure 1 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 2 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 3 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 4 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Viaarxiv icon

Simulating the Visual World with Artificial Intelligence: A Roadmap

Add code
Nov 11, 2025
Viaarxiv icon

RelightMaster: Precise Video Relighting with Multi-plane Light Images

Add code
Nov 09, 2025
Viaarxiv icon

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Add code
Oct 30, 2025
Figure 1 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 2 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 3 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 4 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Viaarxiv icon

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

Add code
Oct 29, 2025
Viaarxiv icon

GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping

Add code
Oct 25, 2025
Viaarxiv icon

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Add code
Sep 11, 2025
Viaarxiv icon

Curse of Knowledge: When Complex Evaluation Context Benefits yet Biases LLM Judges

Add code
Sep 03, 2025
Viaarxiv icon

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution

Add code
Jun 24, 2025
Viaarxiv icon

FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation

Add code
Jun 23, 2025
Viaarxiv icon