Alert button

"speech recognition": models, code, and papers
Alert button

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Dongwei Jiang, Wubo Li, Miao Cao, Ruixiong Zhang, Wei Zou, Kun Han, Xiangang Li

Figure 1 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 2 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 3 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 4 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Viaarxiv icon

Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification

Oct 08, 2020
Pedro R. A. S. Bassi, Willian Rampazzo, Romis Attux

Figure 1 for Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification
Figure 2 for Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification
Figure 3 for Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification
Figure 4 for Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification
Viaarxiv icon

Multilingual and Cross-Lingual Intent Detection from Spoken Data

Apr 17, 2021
Daniela Gerz, Pei-Hao Su, Razvan Kusztos, Avishek Mondal, Michał Lis, Eshan Singhal, Nikola Mrkšić, Tsung-Hsien Wen, Ivan Vulić

Figure 1 for Multilingual and Cross-Lingual Intent Detection from Spoken Data
Figure 2 for Multilingual and Cross-Lingual Intent Detection from Spoken Data
Figure 3 for Multilingual and Cross-Lingual Intent Detection from Spoken Data
Figure 4 for Multilingual and Cross-Lingual Intent Detection from Spoken Data
Viaarxiv icon

ASR is all you need: cross-modal distillation for lip reading

Nov 28, 2019
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman

Figure 1 for ASR is all you need: cross-modal distillation for lip reading
Figure 2 for ASR is all you need: cross-modal distillation for lip reading
Figure 3 for ASR is all you need: cross-modal distillation for lip reading
Figure 4 for ASR is all you need: cross-modal distillation for lip reading
Viaarxiv icon

Task-aware Warping Factors in Mask-based Speech Enhancement

Aug 27, 2021
Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto

Figure 1 for Task-aware Warping Factors in Mask-based Speech Enhancement
Figure 2 for Task-aware Warping Factors in Mask-based Speech Enhancement
Figure 3 for Task-aware Warping Factors in Mask-based Speech Enhancement
Figure 4 for Task-aware Warping Factors in Mask-based Speech Enhancement
Viaarxiv icon

Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion

Mar 23, 2012
Youssef Bassil, Mohammad Alwani

Figure 1 for Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion
Figure 2 for Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion
Figure 3 for Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion
Figure 4 for Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion
Viaarxiv icon

A Configurable Multilingual Model is All You Need to Recognize All Languages

Jul 13, 2021
Long Zhou, Jinyu Li, Eric Sun, Shujie Liu

Figure 1 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 2 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 3 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 4 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Viaarxiv icon

Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity

Add code
Bookmark button
Alert button
Aug 04, 2021
Chang Gao, Tobi Delbruck, Shih-Chii Liu

Figure 1 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 2 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 3 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 4 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Viaarxiv icon

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Add code
Bookmark button
Alert button
Jun 16, 2021
Gaurav Menghani

Figure 1 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 2 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 3 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 4 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Viaarxiv icon

Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept

Add code
Bookmark button
Alert button
Apr 13, 2021
Wei Zhou, Albert Zeyer, André Merboldt, Ralf Schlüter, Hermann Ney

Figure 1 for Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept
Figure 2 for Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept
Viaarxiv icon