Picture for Zhao Wang

Zhao Wang

CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Add code
Aug 10, 2025
Viaarxiv icon

OMS: On-the-fly, Multi-Objective, Self-Reflective Ad Keyword Generation via LLM Agent

Add code
Jul 03, 2025
Viaarxiv icon

DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation

Add code
Apr 21, 2025
Figure 1 for DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation
Figure 2 for DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation
Figure 3 for DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation
Figure 4 for DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation
Viaarxiv icon

DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Add code
Apr 11, 2025
Viaarxiv icon

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode

Add code
Mar 17, 2025
Viaarxiv icon

Unlocking Pretrained LLMs for Motion-Related Multimodal Generation: A Fine-Tuning Approach to Unify Diffusion and Next-Token Prediction

Add code
Mar 08, 2025
Viaarxiv icon

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

Add code
Mar 04, 2025
Viaarxiv icon

Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting

Add code
Feb 13, 2025
Figure 1 for Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
Figure 2 for Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
Figure 3 for Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
Figure 4 for Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
Viaarxiv icon

AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance

Add code
Feb 12, 2025
Viaarxiv icon

Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits

Add code
Feb 05, 2025
Viaarxiv icon