Picture for Enze Xie

Enze Xie

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

Add code
Jul 16, 2024
Viaarxiv icon

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Viaarxiv icon

DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving

Add code
Mar 25, 2024
Viaarxiv icon

Editing Massive Concepts in Text-to-Image Diffusion Models

Add code
Mar 20, 2024
Figure 1 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 2 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 3 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 4 for Editing Massive Concepts in Text-to-Image Diffusion Models
Viaarxiv icon

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model

Add code
Mar 15, 2024
Figure 1 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 2 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 3 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 4 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Viaarxiv icon

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Add code
Mar 07, 2024
Figure 1 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 2 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 3 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 4 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Viaarxiv icon

Accelerating Diffusion Sampling with Optimized Time Steps

Add code
Feb 27, 2024
Figure 1 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 2 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 3 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 4 for Accelerating Diffusion Sampling with Optimized Time Steps
Viaarxiv icon

On the Expressive Power of a Variant of the Looped Transformer

Add code
Feb 21, 2024
Viaarxiv icon

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Add code
Jan 30, 2024
Viaarxiv icon

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

Add code
Jan 18, 2024
Viaarxiv icon