Alert button
Picture for Karim Helwani

Karim Helwani

Alert button

Sound Source Separation Using Latent Variational Block-Wise Disentanglement

Add code
Bookmark button
Alert button
Feb 08, 2024
Karim Helwani, Masahito Togami, Paris Smaragdis, Michael M. Goodwin

Viaarxiv icon

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

Add code
Bookmark button
Alert button
Feb 01, 2024
Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin

Viaarxiv icon

Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing

Add code
Bookmark button
Alert button
Oct 10, 2023
Karim Helwani, Erfan Soltanmohammadi, Michael M. Goodwin

Figure 1 for Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing
Figure 2 for Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing
Figure 3 for Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing
Figure 4 for Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing
Viaarxiv icon

NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping

Add code
Bookmark button
Alert button
Sep 25, 2023
Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin

Viaarxiv icon

Learning Linear Groups in Neural Networks

Add code
Bookmark button
Alert button
May 29, 2023
Emmanouil Theodosis, Karim Helwani, Demba Ba

Figure 1 for Learning Linear Groups in Neural Networks
Figure 2 for Learning Linear Groups in Neural Networks
Figure 3 for Learning Linear Groups in Neural Networks
Figure 4 for Learning Linear Groups in Neural Networks
Viaarxiv icon

Robust Audio Anomaly Detection

Add code
Bookmark button
Alert button
Feb 03, 2022
Wo Jae Lee, Karim Helwani, Arvindh Krishnaswamy, Srikanth Tenneti

Figure 1 for Robust Audio Anomaly Detection
Figure 2 for Robust Audio Anomaly Detection
Figure 3 for Robust Audio Anomaly Detection
Figure 4 for Robust Audio Anomaly Detection
Viaarxiv icon

Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet

Add code
Bookmark button
Alert button
Feb 10, 2021
Jean-Marc Valin, Srikanth Tenneti, Karim Helwani, Umut Isik, Arvindh Krishnaswamy

Figure 1 for Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet
Figure 2 for Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet
Figure 3 for Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet
Figure 4 for Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet
Viaarxiv icon

Enhancing Audio Augmentation Methods with Consistency Learning

Add code
Bookmark button
Alert button
Feb 09, 2021
Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang

Figure 1 for Enhancing Audio Augmentation Methods with Consistency Learning
Figure 2 for Enhancing Audio Augmentation Methods with Consistency Learning
Viaarxiv icon

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

Add code
Bookmark button
Alert button
Aug 11, 2020
Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy

Figure 1 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 2 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 3 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 4 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Viaarxiv icon