Alert button
Picture for Limin Wang

Limin Wang

Alert button

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation

Add code
Bookmark button
Alert button
May 30, 2023
Chuhao Jin, Wenhui Tan, Jiange Yang, Bei Liu, Ruihua Song, Limin Wang, Jianlong Fu

Figure 1 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 2 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 3 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 4 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Viaarxiv icon

MixFormerV2: Efficient Fully Transformer Tracking

Add code
Bookmark button
Alert button
May 25, 2023
Yutao Cui, Tianhui Song, Gangshan Wu, Limin Wang

Figure 1 for MixFormerV2: Efficient Fully Transformer Tracking
Figure 2 for MixFormerV2: Efficient Fully Transformer Tracking
Figure 3 for MixFormerV2: Efficient Fully Transformer Tracking
Figure 4 for MixFormerV2: Efficient Fully Transformer Tracking
Viaarxiv icon

VideoLLM: Modeling Video Sequence with Large Language Models

Add code
Bookmark button
Alert button
May 23, 2023
Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang

Figure 1 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 2 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 3 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 4 for VideoLLM: Modeling Video Sequence with Large Language Models
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

VideoChat: Chat-Centric Video Understanding

Add code
Bookmark button
Alert button
May 10, 2023
KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for VideoChat: Chat-Centric Video Understanding
Figure 2 for VideoChat: Chat-Centric Video Understanding
Figure 3 for VideoChat: Chat-Centric Video Understanding
Figure 4 for VideoChat: Chat-Centric Video Understanding
Viaarxiv icon

An Evidential Real-Time Multi-Mode Fault Diagnosis Approach Based on Broad Learning System

Add code
Bookmark button
Alert button
Apr 29, 2023
Chen Li, Zeyi Liu, Limin Wang, Minyue Li, Xiao He

Figure 1 for An Evidential Real-Time Multi-Mode Fault Diagnosis Approach Based on Broad Learning System
Figure 2 for An Evidential Real-Time Multi-Mode Fault Diagnosis Approach Based on Broad Learning System
Figure 3 for An Evidential Real-Time Multi-Mode Fault Diagnosis Approach Based on Broad Learning System
Figure 4 for An Evidential Real-Time Multi-Mode Fault Diagnosis Approach Based on Broad Learning System
Viaarxiv icon

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Add code
Bookmark button
Alert button
Apr 18, 2023
Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao

Figure 1 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 2 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 3 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 4 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Viaarxiv icon

Efficient Video Action Detection with Token Dropout and Context Refinement

Add code
Bookmark button
Alert button
Apr 17, 2023
Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang

Figure 1 for Efficient Video Action Detection with Token Dropout and Context Refinement
Figure 2 for Efficient Video Action Detection with Token Dropout and Context Refinement
Figure 3 for Efficient Video Action Detection with Token Dropout and Context Refinement
Figure 4 for Efficient Video Action Detection with Token Dropout and Context Refinement
Viaarxiv icon

Progressive Visual Prompt Learning with Contrastive Feature Re-formation

Add code
Bookmark button
Alert button
Apr 17, 2023
Chen Xu, Haocheng Shen, Fengyuan Shi, Boheng Chen, Yixuan Liao, Xiaoxin Chen, Limin Wang

Figure 1 for Progressive Visual Prompt Learning with Contrastive Feature Re-formation
Figure 2 for Progressive Visual Prompt Learning with Contrastive Feature Re-formation
Figure 3 for Progressive Visual Prompt Learning with Contrastive Feature Re-formation
Figure 4 for Progressive Visual Prompt Learning with Contrastive Feature Re-formation
Viaarxiv icon

SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes

Add code
Bookmark button
Alert button
Apr 13, 2023
Yutao Cui, Chenkai Zeng, Xiaoyu Zhao, Yichun Yang, Gangshan Wu, Limin Wang

Figure 1 for SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Figure 2 for SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Figure 3 for SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Figure 4 for SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Viaarxiv icon