Alert button
Picture for Jun Du

Jun Du

Alert button

USTC-NELSLIP System Description for DIHARD-III Challenge

Mar 19, 2021
Yuxuan Wang, Maokui He, Shutong Niu, Lei Sun, Tian Gao, Xin Fang, Jia Pan, Jun Du, Chin-Hui Lee

Figure 1 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 2 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 3 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 4 for USTC-NELSLIP System Description for DIHARD-III Challenge
Viaarxiv icon

A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection

Jan 08, 2021
Qing Wang, Jun Du, Hua-Xin Wu, Jia Pan, Feng Ma, Chin-Hui Lee

Figure 1 for A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Figure 2 for A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Figure 3 for A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Figure 4 for A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Viaarxiv icon

Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention

Dec 28, 2020
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin

Figure 1 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 2 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 3 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Figure 4 for Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention
Viaarxiv icon

Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition

Dec 27, 2020
Hengshun Zhou, Debin Meng, Yuanyuan Zhang, Xiaojiang Peng, Jun Du, Kai Wang, Yu Qiao

Figure 1 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 2 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 3 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 4 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Viaarxiv icon

Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain

Nov 08, 2020
Koen Oostermeijer, Qing Wang, Jun Du

Figure 1 for Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain
Figure 2 for Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain
Figure 3 for Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain
Figure 4 for Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain
Viaarxiv icon

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

Nov 03, 2020
Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee

Figure 1 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 2 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 3 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 4 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Viaarxiv icon

Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement

Sep 21, 2020
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee

Figure 1 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 2 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 3 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 4 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Viaarxiv icon

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

Aug 27, 2020
Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee

Figure 1 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Figure 2 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Figure 3 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Viaarxiv icon

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Aug 12, 2020
Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Figure 1 for On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression
Figure 2 for On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression
Viaarxiv icon

Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

Aug 04, 2020
Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Figure 1 for Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression
Figure 2 for Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression
Figure 3 for Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression
Figure 4 for Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression
Viaarxiv icon