Alert button
Picture for Pichao Wang

Pichao Wang

Alert button

Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval

Add code
Bookmark button
Alert button
Mar 26, 2024
Jiamian Wang, Guohao Sun, Pichao Wang, Dongfang Liu, Sohail Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao

Viaarxiv icon

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation

Add code
Bookmark button
Alert button
Nov 20, 2023
Wenhao Li, Mengyuan Liu, Hong Liu, Pichao Wang, Jialun Cai, Nicu Sebe

Viaarxiv icon

Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey

Add code
Bookmark button
Alert button
Oct 19, 2023
Lijuan Zhou, Xiang Meng, Zhihuan Liu, Mengqi Wu, Zhimin Gao, Pichao Wang

Viaarxiv icon

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

Add code
Bookmark button
Alert button
Sep 18, 2023
Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Figure 1 for SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Figure 2 for SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Figure 3 for SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Figure 4 for SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Viaarxiv icon

Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition

Add code
Bookmark button
Alert button
Sep 11, 2023
Yujun Ma, Benjia Zhou, Ruili Wang, Pichao Wang

Figure 1 for Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition
Figure 2 for Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition
Figure 3 for Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition
Figure 4 for Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition
Viaarxiv icon

Revisiting Vision Transformer from the View of Path Ensemble

Add code
Bookmark button
Alert button
Aug 12, 2023
Shuning Chang, Pichao Wang, Hao Luo, Fan Wang, Mike Zheng Shou

Figure 1 for Revisiting Vision Transformer from the View of Path Ensemble
Figure 2 for Revisiting Vision Transformer from the View of Path Ensemble
Figure 3 for Revisiting Vision Transformer from the View of Path Ensemble
Figure 4 for Revisiting Vision Transformer from the View of Path Ensemble
Viaarxiv icon

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment

Add code
Bookmark button
Alert button
Jul 24, 2023
Sarah Ibrahimi, Xiaohang Sun, Pichao Wang, Amanmeet Garg, Ashutosh Sanan, Mohamed Omar

Figure 1 for Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment
Figure 2 for Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment
Figure 3 for Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment
Figure 4 for Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment
Viaarxiv icon

DOAD: Decoupled One Stage Action Detection Network

Add code
Bookmark button
Alert button
Apr 04, 2023
Shuning Chang, Pichao Wang, Fan Wang, Jiashi Feng, Mike Zheng Show

Figure 1 for DOAD: Decoupled One Stage Action Detection Network
Figure 2 for DOAD: Decoupled One Stage Action Detection Network
Figure 3 for DOAD: Decoupled One Stage Action Detection Network
Figure 4 for DOAD: Decoupled One Stage Action Detection Network
Viaarxiv icon