Alert button
Picture for Pengyuan Zhang

Pengyuan Zhang

Alert button

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

Add code
Bookmark button
Alert button
Feb 22, 2022
Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Figure 1 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 2 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 3 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 4 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Viaarxiv icon

The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge

Add code
Bookmark button
Alert button
Jan 29, 2022
Ziyi Chen, Hua Hua, Yuxiang Zhang, Ming Li, Pengyuan Zhang

Figure 1 for The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge
Figure 2 for The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge
Figure 3 for The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge
Figure 4 for The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge
Viaarxiv icon

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

Add code
Bookmark button
Alert button
Jan 26, 2022
Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang

Figure 1 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 2 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 3 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 4 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Viaarxiv icon

Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 23, 2021
Changfeng Gao, Gaofeng Cheng, Yifan Guo, Qingwei Zhao, Pengyuan Zhang

Figure 1 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 2 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 3 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 4 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Viaarxiv icon

Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition

Add code
Bookmark button
Alert button
Oct 09, 2021
Han Zhu, Li Wang, Ying Hou, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition
Figure 2 for Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition
Figure 3 for Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition
Figure 4 for Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition
Viaarxiv icon

DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement

Add code
Bookmark button
Alert button
Apr 27, 2021
Feng Dang, Hangting Chen, Pengyuan Zhang

Figure 1 for DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Figure 2 for DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Figure 3 for DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Viaarxiv icon

Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search

Add code
Bookmark button
Alert button
Apr 13, 2021
Yukun Liu, Ta Li, Pengyuan Zhang, Yonghong Yan

Figure 1 for Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Figure 2 for Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Figure 3 for Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Figure 4 for Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Viaarxiv icon

Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output

Add code
Bookmark button
Alert button
Feb 20, 2021
Hangting Chen, Pengyuan Zhang

Figure 1 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 2 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 3 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 4 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Viaarxiv icon

Domain Adaptation Using Class Similarity for Robust Speech Recognition

Add code
Bookmark button
Alert button
Nov 05, 2020
Han Zhu, Jiangjiang Zhao, Yuling Ren, Li Wang, Pengyuan Zhang

Figure 1 for Domain Adaptation Using Class Similarity for Robust Speech Recognition
Figure 2 for Domain Adaptation Using Class Similarity for Robust Speech Recognition
Viaarxiv icon

Multi-Accent Adaptation based on Gate Mechanism

Add code
Bookmark button
Alert button
Nov 05, 2020
Han Zhu, Li Wang, Pengyuan Zhang, Yonghong Yan

Figure 1 for Multi-Accent Adaptation based on Gate Mechanism
Figure 2 for Multi-Accent Adaptation based on Gate Mechanism
Figure 3 for Multi-Accent Adaptation based on Gate Mechanism
Figure 4 for Multi-Accent Adaptation based on Gate Mechanism
Viaarxiv icon