Picture for Xiaoming Wei

Xiaoming Wei

Meituan

WildActor: Unconstrained Identity-Preserving Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation

Add code
Feb 27, 2026
Viaarxiv icon

PosterOmni: Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

Add code
Feb 12, 2026
Viaarxiv icon

LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts

Add code
Feb 12, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models

Add code
Jan 08, 2026
Viaarxiv icon

Active Intelligence in Video Avatars via Closed-loop World Modeling

Add code
Dec 23, 2025
Viaarxiv icon

UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models

Add code
Dec 12, 2025
Figure 1 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 2 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 3 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 4 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Viaarxiv icon

LongCat-Image Technical Report

Add code
Dec 08, 2025
Figure 1 for LongCat-Image Technical Report
Figure 2 for LongCat-Image Technical Report
Figure 3 for LongCat-Image Technical Report
Figure 4 for LongCat-Image Technical Report
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Figure 1 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 2 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 3 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 4 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Viaarxiv icon