Alert button
Picture for Yasi Peng

Yasi Peng

Alert button

Spatial-temporal Transformer for Affective Behavior Analysis

Mar 19, 2023
Peng Zou, Rui Wang, Kehua Wen, Yasi Peng, Xiao Sun

Figure 1 for Spatial-temporal Transformer for Affective Behavior Analysis
Figure 2 for Spatial-temporal Transformer for Affective Behavior Analysis

The in-the-wild affective behavior analysis has been an important study. In this paper, we submit our solutions for the 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), which includes V-A Estimation, Facial Expression Classification and AU Detection Sub-challenges. We propose a Transformer Encoder with Multi-Head Attention framework to learn the distribution of both the spatial and temporal features. Besides, there are virious effective data augmentation strategies employed to alleviate the problems of sample imbalance during model training. The results fully demonstrate the effectiveness of our proposed model based on the Aff-Wild2 dataset.

Viaarxiv icon

Hybrid Multimodal Fusion for Humor Detection

Sep 24, 2022
Haojie Xu, Weifeng Liu, Jingwei Liu, Mingzheng Li, Yu Feng, Yasi Peng, Yunwei Shi, Xiao Sun, Meng Wang

Figure 1 for Hybrid Multimodal Fusion for Humor Detection
Figure 2 for Hybrid Multimodal Fusion for Humor Detection
Figure 3 for Hybrid Multimodal Fusion for Humor Detection
Figure 4 for Hybrid Multimodal Fusion for Humor Detection

In this paper, we present our solution to the MuSe-Humor sub-challenge of the Multimodal Emotional Challenge (MuSe) 2022. The goal of the MuSe-Humor sub-challenge is to detect humor and calculate AUC from audiovisual recordings of German football Bundesliga press conferences. It is annotated for humor displayed by the coaches. For this sub-challenge, we first build a discriminant model using the transformer module and BiLSTM module, and then propose a hybrid fusion strategy to use the prediction results of each modality to improve the performance of the model. Our experiments demonstrate the effectiveness of our proposed model and hybrid fusion strategy on multimodal fusion, and the AUC of our proposed model on the test set is 0.8972.

* 7 pages, 1 figure, to appear in MuSe 2022 (ACM MM2022 co-located workshop) 
Viaarxiv icon