Alert button

"speech": models, code, and papers
Alert button

Robust Speaker Recognition Using Speech Enhancement And Attention Model

Jan 14, 2020
Yanpei Shi, Qiang Huang, Thomas Hain

Figure 1 for Robust Speaker Recognition Using Speech Enhancement And Attention Model
Figure 2 for Robust Speaker Recognition Using Speech Enhancement And Attention Model
Figure 3 for Robust Speaker Recognition Using Speech Enhancement And Attention Model
Figure 4 for Robust Speaker Recognition Using Speech Enhancement And Attention Model
Viaarxiv icon

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

Jun 18, 2021
Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

Figure 1 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 2 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 3 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 4 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Viaarxiv icon

Improving EEG based Continuous Speech Recognition

Nov 24, 2019
Gautam Krishna, Co Tran, Mason Carnahan, Yan Han, Ahmed H Tewfik

Figure 1 for Improving EEG based Continuous Speech Recognition
Figure 2 for Improving EEG based Continuous Speech Recognition
Figure 3 for Improving EEG based Continuous Speech Recognition
Figure 4 for Improving EEG based Continuous Speech Recognition
Viaarxiv icon

Glottal Closure and Opening Instant Detection from Speech Signals

Dec 28, 2019
Thomas Drugman, Thierry Dutoit

Figure 1 for Glottal Closure and Opening Instant Detection from Speech Signals
Figure 2 for Glottal Closure and Opening Instant Detection from Speech Signals
Figure 3 for Glottal Closure and Opening Instant Detection from Speech Signals
Figure 4 for Glottal Closure and Opening Instant Detection from Speech Signals
Viaarxiv icon

On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers

Nov 08, 2020
Shucong Zhang, Erfan Loweimi, Peter Bell, Steve Renals

Figure 1 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 2 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 3 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 4 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Viaarxiv icon

Parallel Neural Text-to-Speech

Add code
Bookmark button
Alert button
May 21, 2019
Kainan Peng, Wei Ping, Zhao Song, Kexin Zhao

Figure 1 for Parallel Neural Text-to-Speech
Figure 2 for Parallel Neural Text-to-Speech
Figure 3 for Parallel Neural Text-to-Speech
Figure 4 for Parallel Neural Text-to-Speech
Viaarxiv icon

ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting

Add code
Bookmark button
Alert button
Jan 15, 2022
Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng

Figure 1 for ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
Figure 2 for ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
Figure 3 for ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
Figure 4 for ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
Viaarxiv icon

Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Add code
Bookmark button
Alert button
May 06, 2022
Yuan Gong, Jin Yu, James Glass

Figure 1 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 2 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 3 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 4 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Viaarxiv icon

Pre-training for low resource speech-to-intent applications

Mar 30, 2021
Pu Wang, Hugo Van hamme

Figure 1 for Pre-training for low resource speech-to-intent applications
Figure 2 for Pre-training for low resource speech-to-intent applications
Figure 3 for Pre-training for low resource speech-to-intent applications
Viaarxiv icon

DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement

Add code
Bookmark button
Alert button
Jun 16, 2021
Shubo Lv, Yanxin Hu, Shimin Zhang, Lei Xie

Figure 1 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 2 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 3 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 4 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Viaarxiv icon