Alert button
Picture for Linhao Dong

Linhao Dong

Alert button

SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR

Add code
Bookmark button
Alert button
Mar 04, 2024
Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma

Figure 1 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 2 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 3 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Figure 4 for SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Viaarxiv icon

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

Add code
Bookmark button
Alert button
May 27, 2023
Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma

Figure 1 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 2 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 3 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 4 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Viaarxiv icon

Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

Add code
Bookmark button
Alert button
Nov 17, 2022
Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

Figure 1 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 2 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 3 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 4 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Viaarxiv icon

Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire

Add code
Bookmark button
Alert button
Jun 27, 2022
Zhiyun Fan, Linhao Dong, Meng Cai, Zejun Ma, Bo Xu

Figure 1 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 2 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 3 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 4 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Viaarxiv icon

Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection

Add code
Bookmark button
Alert button
Jan 30, 2022
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

Figure 1 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 2 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 3 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 4 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Viaarxiv icon

cif-based collaborative decoding for end-to-end contextual speech recognition

Add code
Bookmark button
Alert button
Dec 17, 2020
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

Figure 1 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 2 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 3 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 4 for cif-based collaborative decoding for end-to-end contextual speech recognition
Viaarxiv icon

A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition

Add code
Bookmark button
Alert button
May 25, 2020
Linhao Dong, Cheng Yi, Jianzong Wang, Shiyu Zhou, Shuang Xu, Xueli Jia, Bo Xu

Figure 1 for A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Figure 2 for A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Figure 3 for A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Figure 4 for A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Viaarxiv icon

CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
May 27, 2019
Linhao Dong, Bo Xu

Figure 1 for CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Figure 2 for CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Figure 3 for CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Figure 4 for CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Viaarxiv icon

Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping

Add code
Bookmark button
Alert button
Feb 18, 2019
Linhao Dong, Feng Wang, Bo Xu

Figure 1 for Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Figure 2 for Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Figure 3 for Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Figure 4 for Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Viaarxiv icon