Alert button
Picture for Licheng Yu

Licheng Yu

Alert button

Animated Stickers: Bringing Stickers to Life with Video Diffusion

Add code
Bookmark button
Alert button
Feb 08, 2024
David Yan, Winnie Zhang, Luxin Zhang, Anmol Kalia, Dingkang Wang, Ankit Ramchandani, Miao Liu, Albert Pumarola, Edgar Schoenfeld, Elliot Blanchard, Krishna Narni, Yaqiao Luo, Lawrence Chen, Guan Pang, Ali Thabet, Peter Vajda, Amy Bearman, Licheng Yu

Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Add code
Bookmark button
Alert button
Dec 29, 2023
Feng Liang, Bichen Wu, Jialiang Wang, Licheng Yu, Kunpeng Li, Yinan Zhao, Ishan Misra, Jia-Bin Huang, Peizhao Zhang, Peter Vajda, Diana Marculescu

Viaarxiv icon

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

Add code
Bookmark button
Alert button
Dec 20, 2023
Bichen Wu, Ching-Yao Chuang, Xiaoyan Wang, Yichen Jia, Kapil Krishnakumar, Tong Xiao, Feng Liang, Licheng Yu, Peter Vajda

Viaarxiv icon

AVID: Any-Length Video Inpainting with Diffusion Model

Add code
Bookmark button
Alert button
Dec 06, 2023
Zhixing Zhang, Bichen Wu, Xiaoyan Wang, Yaqiao Luo, Luxin Zhang, Yinan Zhao, Peter Vajda, Dimitris Metaxas, Licheng Yu

Figure 1 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 2 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 3 for AVID: Any-Length Video Inpainting with Diffusion Model
Figure 4 for AVID: Any-Length Video Inpainting with Diffusion Model
Viaarxiv icon

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Add code
Bookmark button
Alert button
Dec 05, 2023
Yuchao Gu, Yipin Zhou, Bichen Wu, Licheng Yu, Jia-Wei Liu, Rui Zhao, Jay Zhangjie Wu, David Junhao Zhang, Mike Zheng Shou, Kevin Tang

Viaarxiv icon

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Add code
Bookmark button
Alert button
Nov 17, 2023
Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Viaarxiv icon

AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes

Add code
Bookmark button
Alert button
May 24, 2023
Barry Menglong Yao, Yu Chen, Qifan Wang, Sijia Wang, Minqian Liu, Zhiyang Xu, Licheng Yu, Lifu Huang

Figure 1 for AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
Figure 2 for AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
Figure 3 for AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
Figure 4 for AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
Viaarxiv icon

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations

Add code
Bookmark button
Alert button
Mar 31, 2023
Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li

Figure 1 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 2 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 3 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 4 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Viaarxiv icon

Learning and Verification of Task Structure in Instructional Videos

Add code
Bookmark button
Alert button
Mar 23, 2023
Medhini Narasimhan, Licheng Yu, Sean Bell, Ning Zhang, Trevor Darrell

Figure 1 for Learning and Verification of Task Structure in Instructional Videos
Figure 2 for Learning and Verification of Task Structure in Instructional Videos
Figure 3 for Learning and Verification of Task Structure in Instructional Videos
Figure 4 for Learning and Verification of Task Structure in Instructional Videos
Viaarxiv icon

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks

Add code
Bookmark button
Alert button
Mar 04, 2023
Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang

Figure 1 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 2 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 3 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 4 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Viaarxiv icon