Alert button
Picture for Binbin Zhang

Binbin Zhang

Alert button

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Jan 07, 2024
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Dec 12, 2023
Shengqiang Li, Chao Lei, Baozhong Ma, Binbin Zhang, Fuping Pan

Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Oct 07, 2023
Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

Viaarxiv icon

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech

Aug 31, 2023
Jie Chen, Xingchen Song, Zhendong Peng, Binbin Zhang, Fuping Pan, Zhiyong Wu

Figure 1 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Figure 2 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Figure 3 for LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Viaarxiv icon

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs

May 18, 2023
Xingchen Song, Di Wu, Binbin Zhang, Zhendong Peng, Bo Dang, Fuping Pan, Zhiyong Wu

Figure 1 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 2 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 3 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Figure 4 for ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
Viaarxiv icon

The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results

Nov 03, 2022
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu

Figure 1 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 2 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 3 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 4 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Viaarxiv icon

TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty

Nov 01, 2022
Xingchen Song, Di Wu, Zhiyong Wu, Binbin Zhang, Yuekai Zhang, Zhendong Peng, Wenpeng Li, Fuping Pan, Changbao Zhu

Figure 1 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 2 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 3 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Figure 4 for TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Viaarxiv icon

Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit

Nov 01, 2022
Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian

Figure 1 for Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit
Figure 2 for Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit
Figure 3 for Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit
Figure 4 for Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit
Viaarxiv icon

FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition

Oct 31, 2022
Xingchen Song, Di Wu, Binbin Zhang, Zhiyong Wu, Wenpeng Li, Dongfang Li, Pengshen Zhang, Zhendong Peng, Fuping Pan, Changbao Zhu, Zhongqin Wu

Figure 1 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 2 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 3 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 4 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Viaarxiv icon

WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

Oct 30, 2022
Jie Wang, Menglong Xu, Jingyong Hou, Binbin Zhang, Xiao-Lei Zhang, Lei Xie, Fuping Pan

Figure 1 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 2 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 3 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 4 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Viaarxiv icon