Alert button
Picture for Ritwik Giri

Ritwik Giri

Alert button

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

Add code
Bookmark button
Alert button
Feb 01, 2024
Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin

Viaarxiv icon

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Add code
Bookmark button
Alert button
Feb 23, 2023
Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

Figure 1 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 2 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 3 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 4 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Viaarxiv icon

Semi-supervised Time Domain Target Speaker Extraction with Attention

Add code
Bookmark button
Alert button
Jun 18, 2022
Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy

Figure 1 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 2 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 3 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 4 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Viaarxiv icon

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

Add code
Bookmark button
Alert button
Jun 16, 2022
Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy

Figure 1 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 2 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 3 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 4 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Viaarxiv icon

Improved singing voice separation with chromagram-based pitch-aware remixing

Add code
Bookmark button
Alert button
Mar 28, 2022
Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy

Figure 1 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 2 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 3 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 4 for Improved singing voice separation with chromagram-based pitch-aware remixing
Viaarxiv icon

Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement

Add code
Bookmark button
Alert button
Jun 08, 2021
Ritwik Giri, Shrikant Venkataramani, Jean-Marc Valin, Umut Isik, Arvindh Krishnaswamy

Figure 1 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 2 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 3 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 4 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Viaarxiv icon

Semi-Supervised Singing Voice Separation with Noisy Self-Training

Add code
Bookmark button
Alert button
Feb 16, 2021
Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy

Figure 1 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 2 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 3 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 4 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Viaarxiv icon

Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders

Add code
Bookmark button
Alert button
Feb 12, 2021
Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy

Figure 1 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 2 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 3 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 4 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Viaarxiv icon

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

Add code
Bookmark button
Alert button
Aug 11, 2020
Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy

Figure 1 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 2 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 3 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 4 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Viaarxiv icon

From Speech-to-Speech Translation to Automatic Dubbing

Add code
Bookmark button
Alert button
Feb 02, 2020
Marcello Federico, Robert Enyedi, Roberto Barra-Chicote, Ritwik Giri, Umut Isik, Arvindh Krishnaswamy, Hassan Sawaf

Figure 1 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 2 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 3 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 4 for From Speech-to-Speech Translation to Automatic Dubbing
Viaarxiv icon