Picture for Ziyuan Huang

Ziyuan Huang

Support-Set Based Cross-Supervision for Video Grounding

Add code
Aug 24, 2021
Figure 1 for Support-Set Based Cross-Supervision for Video Grounding
Figure 2 for Support-Set Based Cross-Supervision for Video Grounding
Figure 3 for Support-Set Based Cross-Supervision for Video Grounding
Figure 4 for Support-Set Based Cross-Supervision for Video Grounding
Viaarxiv icon

Exploring Stronger Feature for Temporal Action Localization

Add code
Jun 24, 2021
Figure 1 for Exploring Stronger Feature for Temporal Action Localization
Figure 2 for Exploring Stronger Feature for Temporal Action Localization
Figure 3 for Exploring Stronger Feature for Temporal Action Localization
Figure 4 for Exploring Stronger Feature for Temporal Action Localization
Viaarxiv icon

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling

Add code
Jun 20, 2021
Figure 1 for Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling
Figure 2 for Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling
Viaarxiv icon

Proposal Relation Network for Temporal Action Detection

Add code
Jun 20, 2021
Figure 1 for Proposal Relation Network for Temporal Action Detection
Figure 2 for Proposal Relation Network for Temporal Action Detection
Figure 3 for Proposal Relation Network for Temporal Action Detection
Figure 4 for Proposal Relation Network for Temporal Action Detection
Viaarxiv icon

Relation Modeling in Spatio-Temporal Action Localization

Add code
Jun 16, 2021
Figure 1 for Relation Modeling in Spatio-Temporal Action Localization
Figure 2 for Relation Modeling in Spatio-Temporal Action Localization
Figure 3 for Relation Modeling in Spatio-Temporal Action Localization
Figure 4 for Relation Modeling in Spatio-Temporal Action Localization
Viaarxiv icon

A Stronger Baseline for Ego-Centric Action Detection

Add code
Jun 13, 2021
Figure 1 for A Stronger Baseline for Ego-Centric Action Detection
Figure 2 for A Stronger Baseline for Ego-Centric Action Detection
Viaarxiv icon

Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition

Add code
Jun 09, 2021
Figure 1 for Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
Figure 2 for Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
Figure 3 for Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
Figure 4 for Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
Viaarxiv icon

Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation

Add code
Jun 03, 2021
Figure 1 for Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation
Figure 2 for Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation
Figure 3 for Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation
Figure 4 for Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation
Viaarxiv icon

Self-supervised Motion Learning from Static Images

Add code
Apr 01, 2021
Figure 1 for Self-supervised Motion Learning from Static Images
Figure 2 for Self-supervised Motion Learning from Static Images
Figure 3 for Self-supervised Motion Learning from Static Images
Figure 4 for Self-supervised Motion Learning from Static Images
Viaarxiv icon

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes

Add code
Oct 21, 2020
Figure 1 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 2 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 3 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 4 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Viaarxiv icon