Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

Networks are Slacking Off: Understanding Generalization Problem in Image Deraining

Add code
Bookmark button
Alert button
May 24, 2023
Jinjin Gu, Xianzheng Ma, Xiangtao Kong, Yu Qiao, Chao Dong

Figure 1 for Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
Figure 2 for Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
Figure 3 for Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
Figure 4 for Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
Viaarxiv icon

EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought

Add code
Bookmark button
Alert button
May 24, 2023
Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo

Figure 1 for EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
Figure 2 for EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
Figure 3 for EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
Figure 4 for EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
Viaarxiv icon

Causal Discovery with Unobserved Variables: A Proxy Variable Approach

Add code
Bookmark button
Alert button
May 24, 2023
Mingzhou Liu, Xinwei Sun, Yu Qiao, Yizhou Wang

Figure 1 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 2 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 3 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 4 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Viaarxiv icon

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Add code
Bookmark button
Alert button
May 24, 2023
Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li

Figure 1 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 2 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 3 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 4 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Viaarxiv icon

VideoLLM: Modeling Video Sequence with Large Language Models

Add code
Bookmark button
Alert button
May 23, 2023
Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang

Figure 1 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 2 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 3 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 4 for VideoLLM: Modeling Video Sequence with Large Language Models
Viaarxiv icon

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Add code
Bookmark button
Alert button
May 18, 2023
Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai

Figure 1 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 2 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 3 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 4 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

VideoChat: Chat-Centric Video Understanding

Add code
Bookmark button
Alert button
May 10, 2023
KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for VideoChat: Chat-Centric Video Understanding
Figure 2 for VideoChat: Chat-Centric Video Understanding
Figure 3 for VideoChat: Chat-Centric Video Understanding
Figure 4 for VideoChat: Chat-Centric Video Understanding
Viaarxiv icon

LEO: Generative Latent Image Animator for Human Video Synthesis

Add code
Bookmark button
Alert button
May 06, 2023
Yaohui Wang, Xin Ma, Xinyuan Chen, Antitza Dantcheva, Bo Dai, Yu Qiao

Figure 1 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 2 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 3 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 4 for LEO: Generative Latent Image Animator for Human Video Synthesis
Viaarxiv icon