Picture for Dhananjaya Gowda

Dhananjaya Gowda

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech

Add code
Jan 19, 2024
Figure 1 for Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
Figure 2 for Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
Figure 3 for Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
Figure 4 for Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
Viaarxiv icon

On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition

Add code
Dec 15, 2023
Figure 1 for On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition
Figure 2 for On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition
Figure 3 for On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition
Figure 4 for On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition
Viaarxiv icon

Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals

Add code
Aug 31, 2023
Figure 1 for Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals
Figure 2 for Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals
Figure 3 for Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals
Figure 4 for Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals
Viaarxiv icon

Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods

Add code
Aug 17, 2023
Viaarxiv icon

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Add code
Aug 16, 2023
Figure 1 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 2 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 3 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 4 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Viaarxiv icon

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition

Add code
Oct 01, 2022
Figure 1 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 2 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 3 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 4 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Viaarxiv icon

Two-Pass End-to-End ASR Model Compression

Add code
Jan 08, 2022
Figure 1 for Two-Pass End-to-End ASR Model Compression
Figure 2 for Two-Pass End-to-End ASR Model Compression
Figure 3 for Two-Pass End-to-End ASR Model Compression
Figure 4 for Two-Pass End-to-End ASR Model Compression
Viaarxiv icon

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks

Add code
Jan 05, 2022
Figure 1 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 2 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 3 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 4 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Viaarxiv icon

Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages

Add code
Nov 19, 2021
Figure 1 for Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages
Figure 2 for Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages
Figure 3 for Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages
Figure 4 for Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages
Viaarxiv icon

A comparison of streaming models and data augmentation methods for robust speech recognition

Add code
Nov 19, 2021
Figure 1 for A comparison of streaming models and data augmentation methods for robust speech recognition
Figure 2 for A comparison of streaming models and data augmentation methods for robust speech recognition
Figure 3 for A comparison of streaming models and data augmentation methods for robust speech recognition
Figure 4 for A comparison of streaming models and data augmentation methods for robust speech recognition
Viaarxiv icon