Alert button
Picture for Deli Zhao

Deli Zhao

Alert button

Space Group Constrained Crystal Generation

Add code
Bookmark button
Alert button
Feb 06, 2024
Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, Yang Liu

Viaarxiv icon

Latent Space Editing in Transformer-Based Flow Matching

Add code
Bookmark button
Alert button
Dec 17, 2023
Vincent Tao Hu, David W Zhang, Pascal Mettes, Meng Tang, Deli Zhao, Cees G. M. Snoek

Viaarxiv icon

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Add code
Bookmark button
Alert button
Nov 07, 2023
Shiwei Zhang, Jiayu Wang, Yingya Zhang, Kang Zhao, Hangjie Yuan, Zhiwu Qin, Xiang Wang, Deli Zhao, Jingren Zhou

Viaarxiv icon

Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone

Add code
Bookmark button
Alert button
Oct 30, 2023
Zeyinzi Jiang, Chaojie Mao, Ziyuan Huang, Ao Ma, Yiliang Lv, Yujun Shen, Deli Zhao, Jingren Zhou

Viaarxiv icon

Few-shot Action Recognition with Captioning Foundation Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Xiang Wang, Shiwei Zhang, Hangjie Yuan, Yingya Zhang, Changxin Gao, Deli Zhao, Nong Sang

Figure 1 for Few-shot Action Recognition with Captioning Foundation Models
Figure 2 for Few-shot Action Recognition with Captioning Foundation Models
Figure 3 for Few-shot Action Recognition with Captioning Foundation Models
Figure 4 for Few-shot Action Recognition with Captioning Foundation Models
Viaarxiv icon

Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner

Add code
Bookmark button
Alert button
Oct 14, 2023
Mengfei Xia, Yujun Shen, Changsong Lei, Yu Zhou, Ran Yi, Deli Zhao, Wenping Wang, Yong-jin Liu

Figure 1 for Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Figure 2 for Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Figure 3 for Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Figure 4 for Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Viaarxiv icon

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

Add code
Bookmark button
Alert button
Oct 09, 2023
Shiyue Cao, Yueqin Yin, Lianghua Huang, Yu Liu, Xin Zhao, Deli Zhao, Kaiqi Huang

Figure 1 for Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Figure 2 for Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Figure 3 for Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Figure 4 for Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Viaarxiv icon

In-Domain GAN Inversion for Faithful Reconstruction and Editability

Add code
Bookmark button
Alert button
Sep 25, 2023
Jiapeng Zhu, Yujun Shen, Yinghao Xu, Deli Zhao, Qifeng Chen, Bolei Zhou

Figure 1 for In-Domain GAN Inversion for Faithful Reconstruction and Editability
Figure 2 for In-Domain GAN Inversion for Faithful Reconstruction and Editability
Figure 3 for In-Domain GAN Inversion for Faithful Reconstruction and Editability
Figure 4 for In-Domain GAN Inversion for Faithful Reconstruction and Editability
Viaarxiv icon

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Add code
Bookmark button
Alert button
Sep 14, 2023
Zhiwu Qing, Shiwei Zhang, Ziyuan Huang, Yingya Zhang, Changxin Gao, Deli Zhao, Nong Sang

Figure 1 for Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Figure 2 for Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Figure 3 for Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Figure 4 for Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Viaarxiv icon

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

Add code
Bookmark button
Alert button
Aug 18, 2023
Hangjie Yuan, Shiwei Zhang, Xiang Wang, Samuel Albanie, Yining Pan, Tao Feng, Jianwen Jiang, Dong Ni, Yingya Zhang, Deli Zhao

Figure 1 for RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Figure 2 for RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Figure 3 for RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Figure 4 for RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Viaarxiv icon