Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

VideoBooth: Diffusion-based Video Generation with Image Prompts

Add code
Bookmark button
Alert button
Dec 01, 2023
Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu

Figure 1 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 2 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 3 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 4 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Viaarxiv icon

MLLMs-Augmented Visual-Language Representation Learning

Add code
Bookmark button
Alert button
Dec 01, 2023
Yanqing Liu, Kai Wang, Wenqi Shao, Ping Luo, Yu Qiao, Mike Zheng Shou, Kaipeng Zhang, Yang You

Figure 1 for MLLMs-Augmented Visual-Language Representation Learning
Figure 2 for MLLMs-Augmented Visual-Language Representation Learning
Figure 3 for MLLMs-Augmented Visual-Language Representation Learning
Figure 4 for MLLMs-Augmented Visual-Language Representation Learning
Viaarxiv icon

VBench: Comprehensive Benchmark Suite for Video Generative Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu

Viaarxiv icon

Query-Relevant Images Jailbreak Large Multi-Modal Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao

Viaarxiv icon

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Add code
Bookmark button
Alert button
Nov 28, 2023
Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao

Figure 1 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 2 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 3 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 4 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Viaarxiv icon

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

Add code
Bookmark button
Alert button
Nov 28, 2023
Licheng Wen, Xuemeng Yang, Daocheng Fu, Xiaofeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao

Figure 1 for On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Figure 2 for On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Figure 3 for On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Figure 4 for On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Viaarxiv icon

SinSR: Diffusion-Based Image Super-Resolution in a Single Step

Add code
Bookmark button
Alert button
Nov 23, 2023
Yufei Wang, Wenhan Yang, Xinyuan Chen, Yaohui Wang, Lanqing Guo, Lap-Pui Chau, Ziwei Liu, Yu Qiao, Alex C. Kot, Bihan Wen

Figure 1 for SinSR: Diffusion-Based Image Super-Resolution in a Single Step
Viaarxiv icon

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

Add code
Bookmark button
Alert button
Nov 23, 2023
Yu Yi, Xue Yang, Qingyun Li, Feipeng Da, Junchi Yan, Jifeng Dai, Yu Qiao

Viaarxiv icon

DiffusionMat: Alpha Matting as Sequential Refinement Learning

Add code
Bookmark button
Alert button
Nov 22, 2023
Yangyang Xu, Shengfeng He, Wenqi Shao, Kwan-Yee K. Wong, Yu Qiao, Ping Luo

Viaarxiv icon