Picture for Hynek Hermansky

Hynek Hermansky

Self-supervised Learning with Speech Modulation Dropout

Mar 22, 2023
Figure 1 for Self-supervised Learning with Speech Modulation Dropout
Figure 2 for Self-supervised Learning with Speech Modulation Dropout
Figure 3 for Self-supervised Learning with Speech Modulation Dropout
Figure 4 for Self-supervised Learning with Speech Modulation Dropout
Viaarxiv icon

Stabilized training of joint energy-based models and their practical applications

Mar 07, 2023
Figure 1 for Stabilized training of joint energy-based models and their practical applications
Figure 2 for Stabilized training of joint energy-based models and their practical applications
Figure 3 for Stabilized training of joint energy-based models and their practical applications
Figure 4 for Stabilized training of joint energy-based models and their practical applications
Viaarxiv icon

Blind Signal Dereverberation for Machine Speech Recognition

Sep 30, 2022
Figure 1 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 2 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 3 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 4 for Blind Signal Dereverberation for Machine Speech Recognition
Viaarxiv icon

Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives

Add code
Mar 31, 2022
Figure 1 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 2 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 3 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 4 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Viaarxiv icon

Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech

Add code
Mar 31, 2022
Figure 1 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 2 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 3 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 4 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Viaarxiv icon

Radically Old Way of Computing Spectra: Applications in End-to-End ASR

Add code
Apr 02, 2021
Figure 1 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 2 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 3 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 4 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Viaarxiv icon

Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR

Feb 05, 2021
Figure 1 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 2 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 3 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 4 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Viaarxiv icon

A practical two-stage training strategy for multi-stream end-to-end speech recognition

Oct 23, 2019
Figure 1 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 2 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 3 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 4 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Viaarxiv icon

Multi-Stream End-to-End Speech Recognition

Jun 17, 2019
Figure 1 for Multi-Stream End-to-End Speech Recognition
Figure 2 for Multi-Stream End-to-End Speech Recognition
Figure 3 for Multi-Stream End-to-End Speech Recognition
Figure 4 for Multi-Stream End-to-End Speech Recognition
Viaarxiv icon

Performance Monitoring for End-to-End Speech Recognition

Apr 09, 2019
Figure 1 for Performance Monitoring for End-to-End Speech Recognition
Figure 2 for Performance Monitoring for End-to-End Speech Recognition
Figure 3 for Performance Monitoring for End-to-End Speech Recognition
Figure 4 for Performance Monitoring for End-to-End Speech Recognition
Viaarxiv icon