Alert button
Picture for Pengyuan Zhang

Pengyuan Zhang

Alert button

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Add code
Bookmark button
Alert button
Oct 12, 2022
Shuhao Deng, Chengfei Li, infeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Add code
Bookmark button
Alert button
Aug 17, 2022
Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan

Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

SASV Based on Pre-trained ASV System and Integrated Scoring Module

Add code
Bookmark button
Alert button
Jul 01, 2022
Yuxiang Zhang, Zhuo Li, Wenchao Wang, Pengyuan Zhang

Figure 1 for SASV Based on Pre-trained ASV System and Integrated Scoring Module
Figure 2 for SASV Based on Pre-trained ASV System and Integrated Scoring Module
Figure 3 for SASV Based on Pre-trained ASV System and Integrated Scoring Module
Figure 4 for SASV Based on Pre-trained ASV System and Integrated Scoring Module
Viaarxiv icon

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Add code
Bookmark button
Alert button
Jun 28, 2022
Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 2 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 3 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Viaarxiv icon

Boosting Cross-Domain Speech Recognition with Self-Supervision

Add code
Bookmark button
Alert button
Jun 20, 2022
Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan

Figure 1 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 2 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 3 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 4 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Viaarxiv icon

Decoupled Federated Learning for ASR with Non-IID Data

Add code
Bookmark button
Alert button
Jun 18, 2022
Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 2 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 3 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 4 for Decoupled Federated Learning for ASR with Non-IID Data
Viaarxiv icon

Streaming non-autoregressive model for any-to-many voice conversion

Add code
Bookmark button
Alert button
Jun 15, 2022
Ziyi Chen, Haoran Miao, Pengyuan Zhang

Figure 1 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 2 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 3 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 4 for Streaming non-autoregressive model for any-to-many voice conversion
Viaarxiv icon

Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy

Add code
Bookmark button
Alert button
Apr 25, 2022
Chengxin Chen, Meng Wang, Pengyuan Zhang

Figure 1 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 2 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 3 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 4 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Viaarxiv icon

Back-ends Selection for Deep Speaker Embeddings

Add code
Bookmark button
Alert button
Apr 25, 2022
Zhuo Li, Runqiu Xiao, Zihan Zhang, Zhenduo Zhao, Wenchao Wang, Pengyuan Zhang

Figure 1 for Back-ends Selection for Deep Speaker Embeddings
Figure 2 for Back-ends Selection for Deep Speaker Embeddings
Figure 3 for Back-ends Selection for Deep Speaker Embeddings
Figure 4 for Back-ends Selection for Deep Speaker Embeddings
Viaarxiv icon

CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Mar 31, 2022
Chengxin Chen, Pengyuan Zhang

Figure 1 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 2 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 3 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 4 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Viaarxiv icon