Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

VideoChat: Chat-Centric Video Understanding

May 10, 2023
KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for VideoChat: Chat-Centric Video Understanding
Figure 2 for VideoChat: Chat-Centric Video Understanding
Figure 3 for VideoChat: Chat-Centric Video Understanding
Figure 4 for VideoChat: Chat-Centric Video Understanding
Viaarxiv icon

Causal Discovery with Unobserved Variables: A Proxy Variable Approach

May 09, 2023
Mingzhou Liu, Xinwei Sun, Yu Qiao, Yizhou Wang

Figure 1 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 2 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 3 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Figure 4 for Causal Discovery with Unobserved Variables: A Proxy Variable Approach
Viaarxiv icon

LEO: Generative Latent Image Animator for Human Video Synthesis

May 06, 2023
Yaohui Wang, Xin Ma, Xinyuan Chen, Antitza Dantcheva, Bo Dai, Yu Qiao

Figure 1 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 2 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 3 for LEO: Generative Latent Image Animator for Human Video Synthesis
Figure 4 for LEO: Generative Latent Image Animator for Human Video Synthesis
Viaarxiv icon

Long-Term Rhythmic Video Soundtracker

May 02, 2023
Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao

Figure 1 for Long-Term Rhythmic Video Soundtracker
Figure 2 for Long-Term Rhythmic Video Soundtracker
Figure 3 for Long-Term Rhythmic Video Soundtracker
Figure 4 for Long-Term Rhythmic Video Soundtracker
Viaarxiv icon

Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected

Apr 29, 2023
Dongsheng Han, Chaoning Zhang, Yu Qiao, Maryam Qamar, Yuna Jung, SeungKyu Lee, Sung-Ho Bae, Choong Seon Hong

Figure 1 for Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected
Figure 2 for Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected
Figure 3 for Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected
Figure 4 for Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected
Viaarxiv icon

LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

Apr 28, 2023
Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao

Figure 1 for LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
Figure 2 for LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
Figure 3 for LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
Figure 4 for LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
Viaarxiv icon

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Apr 25, 2023
Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu

Figure 1 for Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Figure 2 for Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Figure 3 for Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Figure 4 for Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Viaarxiv icon

Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving

Apr 20, 2023
Huijie Wang, Zhenbo Liu, Yang Li, Tianyu Li, Li Chen, Chonghao Sima, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Jun Yao, Yu Qiao, Hongyang Li

Figure 1 for Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving
Figure 2 for Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving
Figure 3 for Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving
Figure 4 for Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving
Viaarxiv icon

Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles

Apr 19, 2023
Xiaoliang Ju, Yiyang Sun, Yiming Hao, Yikang Li, Yu Qiao, Hongsheng Li

Figure 1 for Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles
Figure 2 for Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles
Figure 3 for Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles
Figure 4 for Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles
Viaarxiv icon