Alert button

"speech": models, code, and papers
Alert button

Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

Add code
Bookmark button
Alert button
Oct 25, 2019
Alexandros Kastanos, Anton Ragni, Mark Gales

Figure 1 for Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks
Figure 2 for Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks
Figure 3 for Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks
Figure 4 for Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks
Viaarxiv icon

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild

Add code
Bookmark button
Alert button
Aug 16, 2018
Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman

Figure 1 for Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Figure 2 for Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Figure 3 for Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Figure 4 for Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Viaarxiv icon

Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech

Add code
Bookmark button
Alert button
Jul 28, 2018
Emre Yılmaz, Henk van den Heuvel, David A. van Leeuwen

Figure 1 for Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Figure 2 for Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Figure 3 for Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Viaarxiv icon

Generating Rich Product Descriptions for Conversational E-commerce Systems

Add code
Bookmark button
Alert button
Nov 30, 2021
Shashank Kedia, Aditya Mantha, Sneha Gupta, Stephen Guo, Kannan Achan

Figure 1 for Generating Rich Product Descriptions for Conversational E-commerce Systems
Figure 2 for Generating Rich Product Descriptions for Conversational E-commerce Systems
Figure 3 for Generating Rich Product Descriptions for Conversational E-commerce Systems
Figure 4 for Generating Rich Product Descriptions for Conversational E-commerce Systems
Viaarxiv icon

Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks

Sep 27, 2020
Gašper Beguš

Figure 1 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 2 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 3 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 4 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Viaarxiv icon

Effectiveness of self-supervised pre-training for speech recognition

Nov 10, 2019
Alexei Baevski, Michael Auli, Abdelrahman Mohamed

Figure 1 for Effectiveness of self-supervised pre-training for speech recognition
Figure 2 for Effectiveness of self-supervised pre-training for speech recognition
Figure 3 for Effectiveness of self-supervised pre-training for speech recognition
Figure 4 for Effectiveness of self-supervised pre-training for speech recognition
Viaarxiv icon

Detecting Anomalies within Time Series using Local Neural Transformations

Add code
Bookmark button
Alert button
Feb 08, 2022
Tim Schneider, Chen Qiu, Marius Kloft, Decky Aspandi Latif, Steffen Staab, Stephan Mandt, Maja Rudolph

Figure 1 for Detecting Anomalies within Time Series using Local Neural Transformations
Figure 2 for Detecting Anomalies within Time Series using Local Neural Transformations
Figure 3 for Detecting Anomalies within Time Series using Local Neural Transformations
Figure 4 for Detecting Anomalies within Time Series using Local Neural Transformations
Viaarxiv icon

Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features

Jul 23, 2019
Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen

Figure 1 for Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Figure 2 for Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Figure 3 for Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Viaarxiv icon

Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features

Add code
Bookmark button
Alert button
Apr 08, 2021
Mahsa Elyasi, Gaurav Bharaj

Figure 1 for Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features
Figure 2 for Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features
Figure 3 for Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features
Figure 4 for Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features
Viaarxiv icon

Calibrated Learning to Defer with One-vs-All Classifiers

Add code
Bookmark button
Alert button
Feb 08, 2022
Rajeev Verma, Eric Nalisnick

Figure 1 for Calibrated Learning to Defer with One-vs-All Classifiers
Figure 2 for Calibrated Learning to Defer with One-vs-All Classifiers
Figure 3 for Calibrated Learning to Defer with One-vs-All Classifiers
Figure 4 for Calibrated Learning to Defer with One-vs-All Classifiers
Viaarxiv icon