Picture for Jean-Marc Valin

Jean-Marc Valin

Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction

Add code
May 31, 2024
Viaarxiv icon

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

Add code
Feb 01, 2024
Figure 1 for Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure
Figure 2 for Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure
Figure 3 for Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure
Figure 4 for Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure
Viaarxiv icon

NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping

Add code
Sep 25, 2023
Figure 1 for NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping
Figure 2 for NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping
Figure 3 for NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping
Figure 4 for NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping
Viaarxiv icon

Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity

Add code
Sep 25, 2023
Figure 1 for Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Figure 2 for Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Figure 3 for Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Figure 4 for Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Viaarxiv icon

LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions

Add code
Jul 13, 2023
Figure 1 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 2 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 3 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 4 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Viaarxiv icon

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Add code
Feb 23, 2023
Figure 1 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 2 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 3 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 4 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Viaarxiv icon

Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder

Add code
Dec 08, 2022
Figure 1 for Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
Figure 2 for Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
Figure 3 for Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
Viaarxiv icon

Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity

Add code
Dec 08, 2022
Figure 1 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Figure 2 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Figure 3 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Viaarxiv icon

Semi-supervised Time Domain Target Speaker Extraction with Attention

Add code
Jun 18, 2022
Figure 1 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 2 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 3 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 4 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Viaarxiv icon

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

Add code
Jun 16, 2022
Figure 1 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 2 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 3 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 4 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Viaarxiv icon