Picture for Pichao Wang

Pichao Wang

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Add code
Mar 22, 2023
Viaarxiv icon

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

Add code
Mar 14, 2023
Figure 1 for Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Figure 2 for Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Figure 3 for Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Figure 4 for Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Viaarxiv icon

Head-Free Lightweight Semantic Segmentation with Linear Transformer

Add code
Jan 11, 2023
Viaarxiv icon

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition

Add code
Nov 16, 2022
Viaarxiv icon

Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition

Add code
Oct 06, 2022
Figure 1 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 2 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 3 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 4 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Viaarxiv icon

Effective Vision Transformer Training: A Data-Centric Perspective

Add code
Sep 29, 2022
Figure 1 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 2 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 3 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 4 for Effective Vision Transformer Training: A Data-Centric Perspective
Viaarxiv icon

FT-HID: A Large Scale RGB-D Dataset for First and Third Person Human Interaction Analysis

Add code
Sep 21, 2022
Figure 1 for FT-HID: A Large Scale RGB-D Dataset for First and Third Person Human Interaction Analysis
Figure 2 for FT-HID: A Large Scale RGB-D Dataset for First and Third Person Human Interaction Analysis
Figure 3 for FT-HID: A Large Scale RGB-D Dataset for First and Third Person Human Interaction Analysis
Figure 4 for FT-HID: A Large Scale RGB-D Dataset for First and Third Person Human Interaction Analysis
Viaarxiv icon

BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective

Add code
Feb 19, 2022
Figure 1 for BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective
Figure 2 for BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective
Figure 3 for BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective
Figure 4 for BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective
Viaarxiv icon

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer

Add code
Jan 21, 2022
Figure 1 for Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer
Figure 2 for Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer
Figure 3 for Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer
Figure 4 for Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer
Viaarxiv icon

ELSA: Enhanced Local Self-Attention for Vision Transformer

Add code
Dec 23, 2021
Figure 1 for ELSA: Enhanced Local Self-Attention for Vision Transformer
Figure 2 for ELSA: Enhanced Local Self-Attention for Vision Transformer
Figure 3 for ELSA: Enhanced Local Self-Attention for Vision Transformer
Figure 4 for ELSA: Enhanced Local Self-Attention for Vision Transformer
Viaarxiv icon