Picture for Gaofeng Cheng

Gaofeng Cheng

Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing

Add code
Jan 01, 2025
Viaarxiv icon

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation

Add code
Jan 01, 2025
Figure 1 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 2 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 3 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 4 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Viaarxiv icon

Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition

Add code
Dec 15, 2024
Viaarxiv icon

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Add code
Aug 12, 2023
Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

Add code
Jul 05, 2023
Figure 1 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 2 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 3 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Figure 4 for Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Add code
Feb 26, 2023
Viaarxiv icon

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Add code
Oct 13, 2022
Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Add code
Aug 17, 2022
Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Add code
Jul 06, 2022
Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Add code
Jun 28, 2022
Figure 1 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 2 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 3 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Viaarxiv icon