Picture for Rama Doddipatla

Rama Doddipatla

Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer

Add code
Jul 29, 2022
Figure 1 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 2 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 3 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 4 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Viaarxiv icon

Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition

Add code
May 09, 2022
Figure 1 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 2 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 3 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 4 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Viaarxiv icon

On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training

Add code
May 03, 2022
Figure 1 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 2 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 3 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 4 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Viaarxiv icon

Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling

Add code
Apr 14, 2022
Figure 1 for Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling
Figure 2 for Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling
Figure 3 for Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling
Figure 4 for Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling
Viaarxiv icon

Transformer-based Streaming ASR with Cumulative Attention

Add code
Mar 11, 2022
Figure 1 for Transformer-based Streaming ASR with Cumulative Attention
Figure 2 for Transformer-based Streaming ASR with Cumulative Attention
Figure 3 for Transformer-based Streaming ASR with Cumulative Attention
Figure 4 for Transformer-based Streaming ASR with Cumulative Attention
Viaarxiv icon

A study on cross-corpus speech emotion recognition and data augmentation

Add code
Jan 10, 2022
Figure 1 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 2 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 3 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 4 for A study on cross-corpus speech emotion recognition and data augmentation
Viaarxiv icon

Monaural source separation: From anechoic to reverberant environments

Add code
Nov 15, 2021
Figure 1 for Monaural source separation: From anechoic to reverberant environments
Figure 2 for Monaural source separation: From anechoic to reverberant environments
Figure 3 for Monaural source separation: From anechoic to reverberant environments
Figure 4 for Monaural source separation: From anechoic to reverberant environments
Viaarxiv icon

Towards Handling Unconstrained User Preferences in Dialogue

Add code
Sep 17, 2021
Figure 1 for Towards Handling Unconstrained User Preferences in Dialogue
Figure 2 for Towards Handling Unconstrained User Preferences in Dialogue
Figure 3 for Towards Handling Unconstrained User Preferences in Dialogue
Figure 4 for Towards Handling Unconstrained User Preferences in Dialogue
Viaarxiv icon

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

Add code
Jun 16, 2021
Figure 1 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 2 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 3 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 4 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Viaarxiv icon

Head-synchronous Decoding for Transformer-based Streaming ASR

Add code
Apr 26, 2021
Figure 1 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 2 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 3 for Head-synchronous Decoding for Transformer-based Streaming ASR
Figure 4 for Head-synchronous Decoding for Transformer-based Streaming ASR
Viaarxiv icon