Alert button
Picture for Yonghong Yan

Yonghong Yan

Alert button

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Aug 12, 2023
Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan

Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

Jul 05, 2023
Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 2 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 3 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 4 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Viaarxiv icon

ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement

May 15, 2023
Feng Dang, Qi Hu, Pengyuan Zhang, Yonghong Yan

Figure 1 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 2 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 3 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 4 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Feb 26, 2023
Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 2 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 3 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 4 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Viaarxiv icon

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Oct 13, 2022
Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Aug 17, 2022
Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan

Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Jul 06, 2022
Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan

Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Jun 28, 2022
Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 2 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 3 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Viaarxiv icon

Boosting Cross-Domain Speech Recognition with Self-Supervision

Jun 20, 2022
Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan

Figure 1 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 2 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 3 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 4 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Viaarxiv icon