Alert button
Picture for Samik Sadhu

Samik Sadhu

Alert button

Self-supervised Learning with Speech Modulation Dropout

Add code
Bookmark button
Alert button
Mar 22, 2023
Samik Sadhu, Hynek Hermansky

Figure 1 for Self-supervised Learning with Speech Modulation Dropout
Figure 2 for Self-supervised Learning with Speech Modulation Dropout
Figure 3 for Self-supervised Learning with Speech Modulation Dropout
Figure 4 for Self-supervised Learning with Speech Modulation Dropout
Viaarxiv icon

Stabilized training of joint energy-based models and their practical applications

Add code
Bookmark button
Alert button
Mar 07, 2023
Martin Sustek, Samik Sadhu, Lukas Burget, Hynek Hermansky, Jesus Villalba, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Stabilized training of joint energy-based models and their practical applications
Figure 2 for Stabilized training of joint energy-based models and their practical applications
Figure 3 for Stabilized training of joint energy-based models and their practical applications
Figure 4 for Stabilized training of joint energy-based models and their practical applications
Viaarxiv icon

Blind Signal Dereverberation for Machine Speech Recognition

Add code
Bookmark button
Alert button
Sep 30, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 2 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 3 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 4 for Blind Signal Dereverberation for Machine Speech Recognition
Viaarxiv icon

Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives

Add code
Bookmark button
Alert button
Mar 31, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 2 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 3 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Figure 4 for Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives
Viaarxiv icon

Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech

Add code
Bookmark button
Alert button
Mar 31, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 2 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 3 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Figure 4 for Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech
Viaarxiv icon

Radically Old Way of Computing Spectra: Applications in End-to-End ASR

Add code
Bookmark button
Alert button
Apr 02, 2021
Samik Sadhu, Hynek Hermansky

Figure 1 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 2 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 3 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 4 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Viaarxiv icon

FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 25, 2021
Samik Sadhu, Hynek Hermansky

Figure 1 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 2 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 3 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Figure 4 for FDLP-Spectrogram: Capturing Speech Dynamics in Spectrograms for End-to-end Automatic Speech Recognition
Viaarxiv icon

Wav2vec-C: A Self-supervised Model for Speech Representation Learning

Add code
Bookmark button
Alert button
Mar 09, 2021
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas

Figure 1 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 2 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 3 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Figure 4 for Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Viaarxiv icon

Exploring Methods for the Automatic Detection of Errors in Manual Transcription

Add code
Bookmark button
Alert button
Apr 08, 2019
Xiaofei Wang, Jinyi Yang, Ruizhi Li, Samik Sadhu, Hynek Hermansky

Figure 1 for Exploring Methods for the Automatic Detection of Errors in Manual Transcription
Figure 2 for Exploring Methods for the Automatic Detection of Errors in Manual Transcription
Figure 3 for Exploring Methods for the Automatic Detection of Errors in Manual Transcription
Figure 4 for Exploring Methods for the Automatic Detection of Errors in Manual Transcription
Viaarxiv icon