Alert button
Picture for Enze Xie

Enze Xie

Alert button

DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving

Add code
Bookmark button
Alert button
Mar 25, 2024
Tianqi Wang, Enze Xie, Ruihang Chu, Zhenguo Li, Ping Luo

Viaarxiv icon

Editing Massive Concepts in Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Tianwei Xiong, Yue Wu, Enze Xie, Yue Wu, Zhenguo Li, Xihui Liu

Figure 1 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 2 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 3 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 4 for Editing Massive Concepts in Text-to-Image Diffusion Models
Viaarxiv icon

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model

Add code
Bookmark button
Alert button
Mar 15, 2024
Jiahao Lyu, Jin Wei, Gangyan Zeng, Zeng Li, Enze Xie, Wei Wang, Yu Zhou

Figure 1 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 2 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 3 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Figure 4 for TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Viaarxiv icon

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 07, 2024
Junsong Chen, Chongjian Ge, Enze Xie, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li

Figure 1 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 2 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 3 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 4 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Viaarxiv icon

Accelerating Diffusion Sampling with Optimized Time Steps

Add code
Bookmark button
Alert button
Feb 27, 2024
Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze Xie, Zhenguo Li

Viaarxiv icon

On the Expressive Power of a Variant of the Looped Transformer

Add code
Bookmark button
Alert button
Feb 21, 2024
Yihang Gao, Chuanyang Zheng, Enze Xie, Han Shi, Tianyang Hu, Yu Li, Michael K. Ng, Zhenguo Li, Zhaoqiang Liu

Viaarxiv icon

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Add code
Bookmark button
Alert button
Jan 30, 2024
Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li

Viaarxiv icon

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

Add code
Bookmark button
Alert button
Jan 18, 2024
Zhao Wang, Aoxue Li, Enze Xie, Lingting Zhu, Yong Guo, Qi Dou, Zhenguo Li

Viaarxiv icon

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Add code
Bookmark button
Alert button
Jan 10, 2024
Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li

Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Bookmark button
Alert button
Dec 26, 2023
Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Viaarxiv icon