Picture for Xianrui Zheng

Xianrui Zheng

SOT Triggered Neural Clustering for Speaker Attributed ASR

Add code
Jul 02, 2024
Figure 1 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 2 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 3 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 4 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Viaarxiv icon

Conditional Diffusion Model for Target Speaker Extraction

Add code
Oct 07, 2023
Figure 1 for Conditional Diffusion Model for Target Speaker Extraction
Figure 2 for Conditional Diffusion Model for Target Speaker Extraction
Figure 3 for Conditional Diffusion Model for Target Speaker Extraction
Figure 4 for Conditional Diffusion Model for Target Speaker Extraction
Viaarxiv icon

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Add code
Jun 02, 2023
Figure 1 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 2 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 3 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 4 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Viaarxiv icon

Self-Supervised Learning-Based Source Separation for Meeting Data

Add code
Apr 03, 2023
Figure 1 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 2 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 3 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 4 for Self-Supervised Learning-Based Source Separation for Meeting Data
Viaarxiv icon

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

Add code
Jul 08, 2022
Figure 1 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 2 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 3 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 4 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Viaarxiv icon

Multi-turn RNN-T for streaming recognition of multi-party speech

Add code
Dec 19, 2021
Figure 1 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 2 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 3 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 4 for Multi-turn RNN-T for streaming recognition of multi-party speech
Viaarxiv icon

Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition

Add code
Jul 29, 2021
Figure 1 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 2 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 3 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 4 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Viaarxiv icon