Alert button
Picture for Songjun Cao

Songjun Cao

Alert button

DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model

Add code
Bookmark button
Alert button
Mar 16, 2023
Yanzhe Fu, Yueteng Kang, Songjun Cao, Long Ma

Figure 1 for DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model
Figure 2 for DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model
Figure 3 for DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model
Figure 4 for DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model
Viaarxiv icon

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

Add code
Bookmark button
Alert button
Jun 27, 2022
Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki

Figure 1 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 2 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 3 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 4 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Viaarxiv icon

A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

Add code
Bookmark button
Alert button
Mar 09, 2022
Yike Zhang, Xiaobing Feng, Yi Liu, Songjun Cao, Long Ma

Figure 1 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 2 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 3 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 4 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Viaarxiv icon

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

Add code
Bookmark button
Alert button
Feb 22, 2022
Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Figure 1 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 2 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 3 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Figure 4 for Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
Viaarxiv icon

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

Add code
Bookmark button
Alert button
Dec 14, 2021
Keqi Deng, Songjun Cao, Yike Zhang, Long Ma

Figure 1 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 2 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 3 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 4 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Viaarxiv icon

Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

Add code
Bookmark button
Alert button
Sep 15, 2021
Keqi Deng, Songjun Cao, Long Ma

Figure 1 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 2 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 3 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 4 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Viaarxiv icon

Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning

Add code
Bookmark button
Alert button
Sep 15, 2021
Songjun Cao, Yueteng Kang, Yanzhe Fu, Xiaoshuo Xu, Sining Sun, Yike Zhang, Long Ma

Figure 1 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 2 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 3 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 4 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Viaarxiv icon

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

Add code
Bookmark button
Alert button
Jul 07, 2021
Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

Figure 1 for Improving Speech Recognition Accuracy of Local POI Using Geographical Models
Figure 2 for Improving Speech Recognition Accuracy of Local POI Using Geographical Models
Figure 3 for Improving Speech Recognition Accuracy of Local POI Using Geographical Models
Figure 4 for Improving Speech Recognition Accuracy of Local POI Using Geographical Models
Viaarxiv icon

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

Add code
Bookmark button
Alert button
May 01, 2020
Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

Figure 1 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 2 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 3 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Viaarxiv icon