Picture for Yonghong Yan

Yonghong Yan

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Add code
Aug 12, 2023
Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

Add code
Jul 05, 2023
Figure 1 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 2 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 3 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 4 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Viaarxiv icon

ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement

Add code
May 15, 2023
Figure 1 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 2 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 3 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Figure 4 for ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Add code
Feb 26, 2023
Figure 1 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 2 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 3 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 4 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Viaarxiv icon

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Add code
Oct 13, 2022
Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Add code
Aug 17, 2022
Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Add code
Jul 06, 2022
Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Add code
Jun 28, 2022
Figure 1 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 2 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 3 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Viaarxiv icon

Boosting Cross-Domain Speech Recognition with Self-Supervision

Add code
Jun 20, 2022
Figure 1 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 2 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 3 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 4 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Viaarxiv icon

Decoupled Federated Learning for ASR with Non-IID Data

Add code
Jun 18, 2022
Figure 1 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 2 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 3 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 4 for Decoupled Federated Learning for ASR with Non-IID Data
Viaarxiv icon