Alert button
Picture for Xianrui Zheng

Xianrui Zheng

Alert button

Conditional Diffusion Model for Target Speaker Extraction

Add code
Bookmark button
Alert button
Oct 07, 2023
Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C Woodland

Figure 1 for Conditional Diffusion Model for Target Speaker Extraction
Figure 2 for Conditional Diffusion Model for Target Speaker Extraction
Figure 3 for Conditional Diffusion Model for Target Speaker Extraction
Figure 4 for Conditional Diffusion Model for Target Speaker Extraction
Viaarxiv icon

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Add code
Bookmark button
Alert button
Jun 02, 2023
Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland

Figure 1 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 2 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 3 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Figure 4 for Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Viaarxiv icon

Self-Supervised Learning-Based Source Separation for Meeting Data

Add code
Bookmark button
Alert button
Apr 03, 2023
Yuang Li, Xianrui Zheng, Philip C. Woodland

Figure 1 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 2 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 3 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 4 for Self-Supervised Learning-Based Source Separation for Meeting Data
Viaarxiv icon

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

Add code
Bookmark button
Alert button
Jul 08, 2022
Xianrui Zheng, Chao Zhang, Philip C. Woodland

Figure 1 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 2 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 3 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 4 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Viaarxiv icon

Multi-turn RNN-T for streaming recognition of multi-party speech

Add code
Bookmark button
Alert button
Dec 19, 2021
Ilya Sklyar, Anna Piunova, Xianrui Zheng, Yulan Liu

Figure 1 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 2 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 3 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 4 for Multi-turn RNN-T for streaming recognition of multi-party speech
Viaarxiv icon

Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition

Add code
Bookmark button
Alert button
Jul 29, 2021
Xianrui Zheng, Chao Zhang, Philip C. Woodland

Figure 1 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 2 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 3 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 4 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Viaarxiv icon