Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization



Yifan Chen , Yifan Guo , Qingxuan Li , Gaofeng Cheng , Pengyuan Zhang , Yonghong Yan

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

Boosting Cross-Domain Speech Recognition with Self-Supervision



Han Zhu , Gaofeng Cheng , Jindong Wang , Wenxin Hou , Pengyuan Zhang , Yonghong Yan


   Access Paper or Ask Questions

Decoupled Federated Learning for ASR with Non-IID Data



Han Zhu , Jindong Wang , Gaofeng Cheng , Pengyuan Zhang , Yonghong Yan

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

Streaming non-autoregressive model for any-to-many voice conversion



Ziyi Chen , Haoran Miao , Pengyuan Zhang


   Access Paper or Ask Questions

Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy



Chengxin Chen , Meng Wang , Pengyuan Zhang

* 5 pages, 2 figures, based on the work that won first place in the challenge of DCASE2021 Task 1B 

   Access Paper or Ask Questions

Back-ends Selection for Deep Speaker Embeddings



Zhuo Li , Runqiu Xiao , Zihan Zhang , Zhenduo Zhao , Wenchao Wang , Pengyuan Zhang

* submitted to interspeech2022 

   Access Paper or Ask Questions

CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition



Chengxin Chen , Pengyuan Zhang

* 5 pages, 2 figures, submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset



Zehui Yang , Yifan Chen , Lei Luo , Runyan Yang , Lingxuan Ye , Gaofeng Cheng , Ji Xu , Yaohui Jin , Qingqing Zhang , Pengyuan Zhang , Lei Xie , Yonghong Yan

* Paper on submission to Interspeech2022 

   Access Paper or Ask Questions

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models



Keqi Deng , Songjun Cao , Yike Zhang , Long Ma , Gaofeng Cheng , Ji Xu , Pengyuan Zhang

* ICASSP 2022 

   Access Paper or Ask Questions

The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge



Ziyi Chen , Hua Hua , Yuxiang Zhang , Ming Li , Pengyuan Zhang


   Access Paper or Ask Questions

1
2
3
>>