Alert button

"speech recognition": models, code, and papers
Alert button

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition

Add code
Bookmark button
Alert button
Oct 11, 2019
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer

Figure 1 for From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Figure 2 for From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Figure 3 for From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Figure 4 for From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Viaarxiv icon

Joint Encoder-Decoder Self-Supervised Pre-training for ASR

Jun 09, 2022
Arunkumar A, Umesh S

Figure 1 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 2 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 3 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 4 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Viaarxiv icon

Almost Unsupervised Text to Speech and Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 13, 2019
Yi Ren, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Figure 1 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 2 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 3 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 4 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Viaarxiv icon

Wav2Letter: an End-to-End ConvNet-based Speech Recognition System

Add code
Bookmark button
Alert button
Sep 13, 2016
Ronan Collobert, Christian Puhrsch, Gabriel Synnaeve

Figure 1 for Wav2Letter: an End-to-End ConvNet-based Speech Recognition System
Figure 2 for Wav2Letter: an End-to-End ConvNet-based Speech Recognition System
Figure 3 for Wav2Letter: an End-to-End ConvNet-based Speech Recognition System
Figure 4 for Wav2Letter: an End-to-End ConvNet-based Speech Recognition System
Viaarxiv icon

SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Oct 16, 2022
Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee

Figure 1 for SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Figure 2 for SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Figure 3 for SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Figure 4 for SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Viaarxiv icon

Very Deep Self-Attention Networks for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
May 03, 2019
Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Sebastian Stüker, Alexander Waibel

Figure 1 for Very Deep Self-Attention Networks for End-to-End Speech Recognition
Figure 2 for Very Deep Self-Attention Networks for End-to-End Speech Recognition
Figure 3 for Very Deep Self-Attention Networks for End-to-End Speech Recognition
Figure 4 for Very Deep Self-Attention Networks for End-to-End Speech Recognition
Viaarxiv icon

FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations

Add code
Bookmark button
Alert button
Mar 25, 2022
Dimitrios Dimitriadis, Mirian Hipolito Garcia, Daniel Madrigal Diaz, Andre Manoel, Robert Sim

Figure 1 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 2 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 3 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 4 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Viaarxiv icon

Learning Noise-Invariant Representations for Robust Speech Recognition

Jul 17, 2018
Davis Liang, Zhiheng Huang, Zachary C. Lipton

Figure 1 for Learning Noise-Invariant Representations for Robust Speech Recognition
Figure 2 for Learning Noise-Invariant Representations for Robust Speech Recognition
Figure 3 for Learning Noise-Invariant Representations for Robust Speech Recognition
Figure 4 for Learning Noise-Invariant Representations for Robust Speech Recognition
Viaarxiv icon

Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation

Add code
Bookmark button
Alert button
May 25, 2022
Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu

Figure 1 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 2 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 3 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 4 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Viaarxiv icon