Picture for Weining Wang

Weining Wang

3SGen: Unified Subject, Style, and Structure-Driven Image Generation with Adaptive Task-specific Memory

Add code
Dec 22, 2025
Viaarxiv icon

TTP: Test-Time Padding for Adversarial Detection and Robust Adaptation on Vision-Language Models

Add code
Dec 18, 2025
Viaarxiv icon

ProAV-DiT: A Projected Latent Diffusion Transformer for Efficient Synchronized Audio-Video Generation

Add code
Nov 15, 2025
Viaarxiv icon

PhysCorr: Dual-Reward DPO for Physics-Constrained Text-to-Video Generation with Automated Preference Selection

Add code
Nov 06, 2025
Viaarxiv icon

Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images

Add code
May 06, 2025
Figure 1 for Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images
Figure 2 for Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images
Figure 3 for Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images
Figure 4 for Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images
Viaarxiv icon

Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection

Add code
May 06, 2025
Viaarxiv icon

AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion

Add code
Mar 10, 2025
Viaarxiv icon

COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation

Add code
Oct 02, 2024
Figure 1 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 2 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 3 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 4 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Viaarxiv icon

MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation

Add code
Oct 02, 2024
Viaarxiv icon

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER

Add code
Sep 23, 2023
Viaarxiv icon