Picture for Gangshan Wu

Gangshan Wu

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Add code
Jul 16, 2024
Viaarxiv icon

AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

Add code
Jul 05, 2024
Viaarxiv icon

Open-Vocabulary Spatio-Temporal Action Detection

Add code
May 17, 2024
Figure 1 for Open-Vocabulary Spatio-Temporal Action Detection
Figure 2 for Open-Vocabulary Spatio-Temporal Action Detection
Figure 3 for Open-Vocabulary Spatio-Temporal Action Detection
Figure 4 for Open-Vocabulary Spatio-Temporal Action Detection
Viaarxiv icon

SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Add code
Apr 06, 2024
Viaarxiv icon

Dual DETRs for Multi-Label Temporal Action Detection

Add code
Mar 31, 2024
Viaarxiv icon

Spatiotemporal Predictive Pre-training for Robotic Motor Control

Add code
Mar 14, 2024
Figure 1 for Spatiotemporal Predictive Pre-training for Robotic Motor Control
Figure 2 for Spatiotemporal Predictive Pre-training for Robotic Motor Control
Figure 3 for Spatiotemporal Predictive Pre-training for Robotic Motor Control
Figure 4 for Spatiotemporal Predictive Pre-training for Robotic Motor Control
Viaarxiv icon

Sketch and Refine: Towards Fast and Accurate Lane Detection

Add code
Jan 26, 2024
Viaarxiv icon

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Add code
Nov 06, 2023
Viaarxiv icon

Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation

Add code
Aug 25, 2023
Figure 1 for Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Figure 2 for Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Figure 3 for Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Figure 4 for Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation
Viaarxiv icon

DPL: Decoupled Prompt Learning for Vision-Language Models

Add code
Aug 19, 2023
Viaarxiv icon