Alert button
Picture for Zuxuan Wu

Zuxuan Wu

Alert button

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Add code
Bookmark button
Alert button
Nov 29, 2023
Junke Wang, Lingchen Meng, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

AdaDiff: Adaptive Step Selection for Fast Diffusion

Add code
Bookmark button
Alert button
Nov 24, 2023
Hui Zhang, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang

Viaarxiv icon

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Add code
Bookmark button
Alert button
Nov 24, 2023
Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M. Alvarez, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models

Add code
Bookmark button
Alert button
Oct 25, 2023
Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu

Viaarxiv icon

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

Add code
Bookmark button
Alert button
Oct 18, 2023
Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 2 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 3 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 4 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Viaarxiv icon

A Survey on Video Diffusion Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data

Add code
Bookmark button
Alert button
Oct 08, 2023
Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang

Figure 1 for Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data
Figure 2 for Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data
Figure 3 for Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data
Figure 4 for Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data
Viaarxiv icon

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation

Add code
Bookmark button
Alert button
Sep 07, 2023
Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu

Figure 1 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 2 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 3 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 4 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Viaarxiv icon

SimDA: Simple Diffusion Adapter for Efficient Video Generation

Add code
Bookmark button
Alert button
Aug 18, 2023
Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 2 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 3 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 4 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Viaarxiv icon

On the Importance of Spatial Relations for Few-shot Action Recognition

Add code
Bookmark button
Alert button
Aug 14, 2023
Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for On the Importance of Spatial Relations for Few-shot Action Recognition
Figure 2 for On the Importance of Spatial Relations for Few-shot Action Recognition
Figure 3 for On the Importance of Spatial Relations for Few-shot Action Recognition
Figure 4 for On the Importance of Spatial Relations for Few-shot Action Recognition
Viaarxiv icon