Picture for Xiaoming Wei

Xiaoming Wei

Meituan

ECHO: Towards Emotionally Appropriate and Contextually Aware Interactive Head Generation

Add code
Mar 18, 2026
Viaarxiv icon

WildActor: Unconstrained Identity-Preserving Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation

Add code
Feb 27, 2026
Viaarxiv icon

PosterOmni: Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

Add code
Feb 12, 2026
Viaarxiv icon

LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts

Add code
Feb 12, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models

Add code
Jan 08, 2026
Viaarxiv icon

Active Intelligence in Video Avatars via Closed-loop World Modeling

Add code
Dec 23, 2025
Viaarxiv icon

UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models

Add code
Dec 12, 2025
Figure 1 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 2 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 3 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Figure 4 for UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
Viaarxiv icon

LongCat-Image Technical Report

Add code
Dec 08, 2025
Figure 1 for LongCat-Image Technical Report
Figure 2 for LongCat-Image Technical Report
Figure 3 for LongCat-Image Technical Report
Figure 4 for LongCat-Image Technical Report
Viaarxiv icon