Alert button
Picture for Guanglu Song

Guanglu Song

Alert button

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

Add code
Bookmark button
Alert button
Apr 08, 2024
Dazhong Shen, Guanglu Song, Zeyue Xue, Fu-Yun Wang, Yu Liu

Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li

Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Bookmark button
Alert button
Mar 25, 2024
Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li

Viaarxiv icon

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Add code
Bookmark button
Alert button
Mar 20, 2024
Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li

Figure 1 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 2 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 3 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 4 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Viaarxiv icon

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Add code
Bookmark button
Alert button
Mar 19, 2024
Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li

Figure 1 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 2 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 3 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 4 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Viaarxiv icon

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Add code
Bookmark button
Alert button
Feb 01, 2024
Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li

Viaarxiv icon

Towards Large-scale Masked Face Recognition

Add code
Bookmark button
Alert button
Oct 25, 2023
Manyuan Zhang, Bingqi Ma, Guanglu Song, Yunxiao Wang, Hongsheng Li, Yu Liu

Figure 1 for Towards Large-scale Masked Face Recognition
Figure 2 for Towards Large-scale Masked Face Recognition
Figure 3 for Towards Large-scale Masked Face Recognition
Figure 4 for Towards Large-scale Masked Face Recognition
Viaarxiv icon

Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection

Add code
Bookmark button
Alert button
Oct 24, 2023
Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li

Figure 1 for Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Figure 2 for Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Figure 3 for Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Figure 4 for Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Viaarxiv icon

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

Add code
Bookmark button
Alert button
May 29, 2023
Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo

Figure 1 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 2 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 3 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 4 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Viaarxiv icon

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising

Add code
Bookmark button
Alert button
May 29, 2023
Fu-Yun Wang, Wenshuo Chen, Guanglu Song, Han-Jia Ye, Yu Liu, Hongsheng Li

Figure 1 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 2 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 3 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 4 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Viaarxiv icon