Picture for Xiaoming Wei

Xiaoming Wei

Meituan

DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution

Add code
Jul 01, 2025
Viaarxiv icon

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Add code
Jun 12, 2025
Viaarxiv icon

LLIA -- Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models

Add code
Jun 06, 2025
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Viaarxiv icon

LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation

Add code
Apr 15, 2025
Viaarxiv icon

Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes

Add code
Apr 14, 2025
Viaarxiv icon

Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models

Add code
Jan 28, 2025
Viaarxiv icon

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Add code
Jan 14, 2025
Viaarxiv icon

CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder

Add code
Dec 23, 2024
Viaarxiv icon

High-Resolution Image Synthesis via Next-Token Prediction

Add code
Nov 22, 2024
Viaarxiv icon