Alert button
Picture for Ping Luo

Ping Luo

Alert button

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Add code
Bookmark button
Alert button
Jun 17, 2022
Teng Wang, Wenhao Jiang, Zhichao Lu, Feng Zheng, Ran Cheng, Chengguo Yin, Ping Luo

Figure 1 for VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Figure 2 for VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Figure 3 for VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Figure 4 for VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Viaarxiv icon

CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer

Add code
Bookmark button
Alert button
Jun 17, 2022
Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo

Figure 1 for CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Figure 2 for CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Figure 3 for CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Figure 4 for CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Viaarxiv icon

AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation

Add code
Bookmark button
Alert button
Jun 16, 2022
Yuanfeng Ji, Haotian Bai, Jie Yang, Chongjian Ge, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, Ping Luo

Figure 1 for AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Figure 2 for AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Figure 3 for AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Figure 4 for AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Viaarxiv icon

CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving

Add code
Bookmark button
Alert button
Jun 08, 2022
Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Zhenguo Li, Ping Luo

Figure 1 for CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
Figure 2 for CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
Figure 3 for CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
Figure 4 for CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
Viaarxiv icon

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

Add code
Bookmark button
Alert button
May 26, 2022
Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo

Figure 1 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 2 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 3 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 4 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Viaarxiv icon

Flow-based Recurrent Belief State Learning for POMDPs

Add code
Bookmark button
Alert button
May 23, 2022
Xiaoyu Chen, Yao Mu, Ping Luo, Shengbo Li, Jianyu Chen

Figure 1 for Flow-based Recurrent Belief State Learning for POMDPs
Figure 2 for Flow-based Recurrent Belief State Learning for POMDPs
Figure 3 for Flow-based Recurrent Belief State Learning for POMDPs
Figure 4 for Flow-based Recurrent Belief State Learning for POMDPs
Viaarxiv icon

An Empirical Investigation of Representation Learning for Imitation

Add code
Bookmark button
Alert button
May 16, 2022
Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

Figure 1 for An Empirical Investigation of Representation Learning for Imitation
Figure 2 for An Empirical Investigation of Representation Learning for Imitation
Figure 3 for An Empirical Investigation of Representation Learning for Imitation
Figure 4 for An Empirical Investigation of Representation Learning for Imitation
Viaarxiv icon

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval

Add code
Bookmark button
Alert button
Apr 26, 2022
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo

Figure 1 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 2 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 3 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 4 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Viaarxiv icon

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer

Add code
Bookmark button
Alert button
Apr 21, 2022
Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang

Figure 1 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 2 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 3 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 4 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Viaarxiv icon

M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

Add code
Bookmark button
Alert button
Apr 19, 2022
Enze Xie, Zhiding Yu, Daquan Zhou, Jonah Philion, Anima Anandkumar, Sanja Fidler, Ping Luo, Jose M. Alvarez

Figure 1 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 2 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 3 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 4 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Viaarxiv icon