Picture for Haofan Wang

Haofan Wang

Unlocking the Latent Canvas: Eliciting and Benchmarking Symbolic Visual Expression in LLMs

Add code
Mar 15, 2026
Viaarxiv icon

SIGMA: Selective-Interleaved Generation with Multi-Attribute Tokens

Add code
Feb 07, 2026
Viaarxiv icon

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Add code
Jan 21, 2026
Viaarxiv icon

OmniPSD: Layered PSD Generation with Diffusion Transformer

Add code
Dec 10, 2025
Viaarxiv icon

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Add code
May 30, 2025
Figure 1 for EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
Figure 2 for EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
Figure 3 for EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
Figure 4 for EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
Viaarxiv icon

GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains

Add code
May 24, 2025
Viaarxiv icon

RepText: Rendering Visual Text via Replicating

Add code
Apr 28, 2025
Viaarxiv icon

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Add code
Apr 16, 2025
Viaarxiv icon

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Add code
Mar 10, 2025
Figure 1 for EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer
Figure 2 for EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer
Figure 3 for EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer
Figure 4 for EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer
Viaarxiv icon

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Add code
Nov 15, 2024
Figure 1 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 2 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 3 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 4 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Viaarxiv icon