Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Add code
Bookmark button
Alert button
Jul 13, 2023
Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 2 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 3 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 4 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Viaarxiv icon

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Add code
Bookmark button
Alert button
Jul 10, 2023
Yuwei Guo, Ceyuan Yang, Anyi Rao, Yaohui Wang, Yu Qiao, Dahua Lin, Bo Dai

Figure 1 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 2 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 3 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 4 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Viaarxiv icon

JourneyDB: A Benchmark for Generative Image Understanding

Add code
Bookmark button
Alert button
Jul 03, 2023
Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li

Figure 1 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 2 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 3 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 4 for JourneyDB: A Benchmark for Generative Image Understanding
Viaarxiv icon

Faster Segment Anything: Towards Lightweight SAM for Mobile Applications

Add code
Bookmark button
Alert button
Jul 01, 2023
Chaoning Zhang, Dongshen Han, Yu Qiao, Jung Uk Kim, Sung-Ho Bae, Seungkyu Lee, Choong Seon Hong

Figure 1 for Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Figure 2 for Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Figure 3 for Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Figure 4 for Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Viaarxiv icon

Denoising Diffusion Semantic Segmentation with Mask Prior Modeling

Add code
Bookmark button
Alert button
Jun 22, 2023
Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang

Figure 1 for Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Figure 2 for Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Figure 3 for Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Figure 4 for Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Viaarxiv icon

Align, Adapt and Inject: Sound-guided Unified Image Generation

Add code
Bookmark button
Alert button
Jun 20, 2023
Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo

Figure 1 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 2 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 3 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 4 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Viaarxiv icon

MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification

Add code
Bookmark button
Alert button
Jun 16, 2023
Dequan Wang, Xiaosong Wang, Lilong Wang, Mengzhang Li, Qian Da, Xiaoqiang Liu, Xiangyu Gao, Jun Shen, Junjun He, Tian Shen, Qi Duan, Jie Zhao, Kang Li, Yu Qiao, Shaoting Zhang

Figure 1 for MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Figure 2 for MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Figure 3 for MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Figure 4 for MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Viaarxiv icon

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2023
Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li

Figure 1 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 2 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 3 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 4 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Viaarxiv icon

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models

Add code
Bookmark button
Alert button
Jun 15, 2023
Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo

Figure 1 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 2 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 3 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 4 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Viaarxiv icon