Alert button
Picture for Hynek Hermansky

Hynek Hermansky

Alert button

Self-supervised Learning with Speech Modulation Dropout

Add code
Bookmark button
Alert button
Mar 22, 2023
Samik Sadhu, Hynek Hermansky

Figure 1 for Self-supervised Learning with Speech Modulation Dropout
Figure 2 for Self-supervised Learning with Speech Modulation Dropout
Figure 3 for Self-supervised Learning with Speech Modulation Dropout
Figure 4 for Self-supervised Learning with Speech Modulation Dropout
Viaarxiv icon

Stabilized training of joint energy-based models and their practical applications

Add code
Bookmark button
Alert button
Mar 07, 2023
Martin Sustek, Samik Sadhu, Lukas Burget, Hynek Hermansky, Jesus Villalba, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Stabilized training of joint energy-based models and their practical applications
Figure 2 for Stabilized training of joint energy-based models and their practical applications
Figure 3 for Stabilized training of joint energy-based models and their practical applications
Figure 4 for Stabilized training of joint energy-based models and their practical applications
Viaarxiv icon

Blind Signal Dereverberation for Machine Speech Recognition

Add code
Bookmark button
Alert button
Sep 30, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 2 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 3 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 4 for Blind Signal Dereverberation for Machine Speech Recognition
Viaarxiv icon

Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives

Add code
Bookmark button
Alert button
Mar 31, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 2 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 3 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 4 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Viaarxiv icon

Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech

Add code
Bookmark button
Alert button
Mar 31, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 2 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 3 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 4 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Viaarxiv icon

Radically Old Way of Computing Spectra: Applications in End-to-End ASR

Add code
Bookmark button
Alert button
Apr 02, 2021
Samik Sadhu, Hynek Hermansky

Figure 1 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 2 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 3 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 4 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Viaarxiv icon

FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 25, 2021
Samik Sadhu, Hynek Hermansky

Figure 1 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 2 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 3 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 4 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Viaarxiv icon

Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR

Add code
Bookmark button
Alert button
Feb 05, 2021
Ruizhi Li, Gregory Sell, Hynek Hermansky

Figure 1 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 2 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 3 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Figure 4 for Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Viaarxiv icon

A practical two-stage training strategy for multi-stream end-to-end speech recognition

Add code
Bookmark button
Alert button
Oct 23, 2019
Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky

Figure 1 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 2 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 3 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Figure 4 for A practical two-stage training strategy for multi-stream end-to-end speech recognition
Viaarxiv icon

Multi-Stream End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 17, 2019
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky

Figure 1 for Multi-Stream End-to-End Speech Recognition
Figure 2 for Multi-Stream End-to-End Speech Recognition
Figure 3 for Multi-Stream End-to-End Speech Recognition
Figure 4 for Multi-Stream End-to-End Speech Recognition
Viaarxiv icon