Alert button
Picture for Zhiyun Fan

Zhiyun Fan

Alert button

SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR

Add code
Bookmark button
Alert button
Mar 04, 2024
Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma

Figure 1 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 2 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 3 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 4 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Viaarxiv icon

Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

Add code
Bookmark button
Alert button
Nov 17, 2022
Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

Figure 1 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 2 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 3 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 4 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Viaarxiv icon

Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire

Add code
Bookmark button
Alert button
Jun 27, 2022
Zhiyun Fan, Linhao Dong, Meng Cai, Zejun Ma, Bo Xu

Figure 1 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 2 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 3 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 4 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Viaarxiv icon

Exploring wav2vec 2.0 on speaker verification and language identification

Add code
Bookmark button
Alert button
Jan 14, 2021
Zhiyun Fan, Meng Li, Shiyu Zhou, Bo Xu

Figure 1 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 2 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 3 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 4 for Exploring wav2vec 2.0 on speaker verification and language identification
Viaarxiv icon

Speaker-aware speech-transformer

Add code
Bookmark button
Alert button
Jan 02, 2020
Zhiyun Fan, Jie Li, Shiyu Zhou, Bo Xu

Figure 1 for Speaker-aware speech-transformer
Figure 2 for Speaker-aware speech-transformer
Figure 3 for Speaker-aware speech-transformer
Figure 4 for Speaker-aware speech-transformer
Viaarxiv icon

Unsupervised pre-traing for sequence to sequence speech recognition

Add code
Bookmark button
Alert button
Oct 28, 2019
Zhiyun Fan, Shiyu Zhou, Bo Xu

Figure 1 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 2 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 3 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 4 for Unsupervised pre-traing for sequence to sequence speech recognition
Viaarxiv icon