Alert button
Picture for Xingchen Song

Xingchen Song

Alert button

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Add code
Bookmark button
Alert button
Oct 07, 2023
Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

Viaarxiv icon

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech

Add code
Bookmark button
Alert button
Aug 31, 2023
Jie Chen, Xingchen Song, Zhendong Peng, Binbin Zhang, Fuping Pan, Zhiyong Wu

Figure 1 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Figure 2 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Figure 3 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Viaarxiv icon

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs

Add code
Bookmark button
Alert button
May 18, 2023
Xingchen Song, Di Wu, Binbin Zhang, Zhendong Peng, Bo Dang, Fuping Pan, Zhiyong Wu

Figure 1 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 2 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 3 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 4 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Viaarxiv icon

CB-Conformer: Contextual biasing Conformer for biased word recognition

Add code
Bookmark button
Alert button
Apr 25, 2023
Yaoxun Xu, Baiji Liu, Qiaochu Huang and, Xingchen Song, Zhiyong Wu, Shiyin Kang, Helen Meng

Figure 1 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 2 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 3 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 4 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Viaarxiv icon

Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

Add code
Bookmark button
Alert button
Nov 02, 2022
Chengdong Liang, Xiao-Lei Zhang, BinBin Zhang, Di Wu, Shengqiang Li, Xingchen Song, Zhendong Peng, Fuping Pan

Viaarxiv icon

TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty

Add code
Bookmark button
Alert button
Nov 01, 2022
Xingchen Song, Di Wu, Zhiyong Wu, Binbin Zhang, Yuekai Zhang, Zhendong Peng, Wenpeng Li, Fuping Pan, Changbao Zhu

Figure 1 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 2 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 3 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 4 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Viaarxiv icon

FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition

Add code
Bookmark button
Alert button
Oct 31, 2022
Xingchen Song, Di Wu, Binbin Zhang, Zhiyong Wu, Wenpeng Li, Dongfang Li, Pengshen Zhang, Zhendong Peng, Fuping Pan, Changbao Zhu, Zhongqin Wu

Figure 1 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 2 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 3 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 4 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Viaarxiv icon

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

Add code
Bookmark button
Alert button
Mar 29, 2022
Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu

Figure 1 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 2 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 3 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 4 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Viaarxiv icon

Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input

Add code
Bookmark button
Alert button
Oct 28, 2020
Xingchen Song, Zhiyong Wu, Yiheng Huang, Chao Weng, Dan Su, Helen Meng

Figure 1 for Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Figure 2 for Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Figure 3 for Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Figure 4 for Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Viaarxiv icon