Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Weihao Gan

STM: SpatioTemporal and Motion Encoding for Action Recognition

Aug 16, 2019

Boyuan Jiang, Mengmeng Wang, Weihao Gan, Wei Wu, Junjie Yan

Figure 1 for STM: SpatioTemporal and Motion Encoding for Action Recognition

Figure 2 for STM: SpatioTemporal and Motion Encoding for Action Recognition

Figure 3 for STM: SpatioTemporal and Motion Encoding for Action Recognition

Figure 4 for STM: SpatioTemporal and Motion Encoding for Action Recognition

Abstract:Spatiotemporal and motion features are two complementary and crucial information for video action recognition. Recent state-of-the-art methods adopt a 3D CNN stream to learn spatiotemporal features and another flow stream to learn motion features. In this work, we aim to efficiently encode these two features in a unified 2D framework. To this end, we first propose an STM block, which contains a Channel-wise SpatioTemporal Module (CSTM) to present the spatiotemporal features and a Channel-wise Motion Module (CMM) to efficiently encode motion features. We then replace original residual blocks in the ResNet architecture with STM blcoks to form a simple yet effective STM network by introducing very limited extra computation cost. Extensive experiments demonstrate that the proposed STM network outperforms the state-of-the-art methods on both temporal-related datasets (i.e., Something-Something v1 & v2 and Jester) and scene-related datasets (i.e., Kinetics-400, UCF-101, and HMDB-51) with the help of encoding spatiotemporal and motion features together.

* Accepted by ICCV2019

Via

Access Paper or Ask Questions

Dynamic Curriculum Learning for Imbalanced Data Classification

Jan 21, 2019

Yiru Wang, Weihao Gan, Wei Wu, Junjie Yan

Figure 1 for Dynamic Curriculum Learning for Imbalanced Data Classification

Figure 2 for Dynamic Curriculum Learning for Imbalanced Data Classification

Figure 3 for Dynamic Curriculum Learning for Imbalanced Data Classification

Figure 4 for Dynamic Curriculum Learning for Imbalanced Data Classification

Abstract:Human attribute analysis is a challenging task in the field of computer vision, since the data is largely imbalance-distributed. Common techniques such as re-sampling and cost-sensitive learning require prior-knowledge to train the system. To address this problem, we propose a unified framework called Dynamic Curriculum Learning (DCL) to online adaptively adjust the sampling strategy and loss learning in single batch, which resulting in better generalization and discrimination. Inspired by the curriculum learning, DCL consists of two level curriculum schedulers: (1) sampling scheduler not only manages the data distribution from imbalanced to balanced but also from easy to hard; (2) loss scheduler controls the learning importance between classification and metric learning loss. Learning from these two schedulers, we demonstrate our DCL framework with the new state-of-the-art performance on the widely used face attribute dataset CelebA and pedestrian attribute dataset RAP.

Via

Access Paper or Ask Questions