Alert button

"speech": models, code, and papers
Alert button

Probabilistic Permutation Invariant Training for Speech Separation

Aug 04, 2019
Midia Yousefi, Soheil Khorram, John H. L. Hansen

Figure 1 for Probabilistic Permutation Invariant Training for Speech Separation
Figure 2 for Probabilistic Permutation Invariant Training for Speech Separation
Figure 3 for Probabilistic Permutation Invariant Training for Speech Separation
Viaarxiv icon

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge

Add code
Bookmark button
Alert button
Dec 23, 2020
Riza Velioglu, Jewgeni Rose

Figure 1 for Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Figure 2 for Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Figure 3 for Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Figure 4 for Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Viaarxiv icon

Challenging the Boundaries of Speech Recognition: The MALACH Corpus

Aug 09, 2019
Michael Picheny, Zóltan Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon

Figure 1 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Figure 2 for Challenging the Boundaries of Speech Recognition: The MALACH Corpus
Viaarxiv icon

UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset

Add code
Bookmark button
Alert button
Jul 12, 2021
Chengyi Wang, Yu Wu, Shujie Liu, Jinyu Li, Yao Qian, Kenichi Kumatani, Furu Wei

Figure 1 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 2 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 3 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 4 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Viaarxiv icon

Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation

Add code
Bookmark button
Alert button
May 25, 2022
Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu

Figure 1 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 2 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 3 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Figure 4 for Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
Viaarxiv icon

Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition

Apr 19, 2021
Wei Zhou, Mohammad Zeineldeen, Zuoyun Zheng, Ralf Schlüter, Hermann Ney

Figure 1 for Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Figure 2 for Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Figure 3 for Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Figure 4 for Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Viaarxiv icon

A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

Jun 03, 2020
Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James Glass

Figure 1 for A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
Figure 2 for A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
Figure 3 for A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
Viaarxiv icon

Long-span language modeling for speech recognition

Nov 11, 2019
Sarangarajan Parthasarathy, William Gale, Xie Chen, George Polovets, Shuangyu Chang

Figure 1 for Long-span language modeling for speech recognition
Figure 2 for Long-span language modeling for speech recognition
Figure 3 for Long-span language modeling for speech recognition
Figure 4 for Long-span language modeling for speech recognition
Viaarxiv icon

Improving CTC-based ASR Models with Gated Interlayer Collaboration

May 25, 2022
Yuting Yang, Yuke Li, Binbin Du

Figure 1 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 2 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 3 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 4 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Viaarxiv icon

TuGeBiC: A Turkish German Bilingual Code-Switching Corpus

Add code
Bookmark button
Alert button
May 02, 2022
Jeanine Treffers-Daller and, Ozlem Çetinoğlu

Figure 1 for TuGeBiC: A Turkish German Bilingual Code-Switching Corpus
Viaarxiv icon