Picture for Laurent Girin

Laurent Girin

GIPSA-CRISSP, PERCEPTION

Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting

Add code
May 30, 2024
Figure 1 for Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Figure 2 for Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Figure 3 for Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Figure 4 for Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Viaarxiv icon

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Add code
Dec 07, 2023
Figure 1 for Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Figure 2 for Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Figure 3 for Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Figure 4 for Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Viaarxiv icon

Unsupervised speech enhancement with deep dynamical generative speech and noise models

Add code
Jun 13, 2023
Figure 1 for Unsupervised speech enhancement with deep dynamical generative speech and noise models
Figure 2 for Unsupervised speech enhancement with deep dynamical generative speech and noise models
Viaarxiv icon

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Add code
May 05, 2023
Viaarxiv icon

Speech Modeling with a Hierarchical Transformer Dynamical VAE

Add code
Mar 07, 2023
Figure 1 for Speech Modeling with a Hierarchical Transformer Dynamical VAE
Figure 2 for Speech Modeling with a Hierarchical Transformer Dynamical VAE
Figure 3 for Speech Modeling with a Hierarchical Transformer Dynamical VAE
Figure 4 for Speech Modeling with a Hierarchical Transformer Dynamical VAE
Viaarxiv icon

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Add code
Jul 04, 2022
Figure 1 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 2 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 3 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 4 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Viaarxiv icon

Learning and controlling the source-filter representation of speech with a variational autoencoder

Add code
Apr 14, 2022
Figure 1 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 2 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 3 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 4 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Viaarxiv icon

Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation

Add code
Apr 05, 2022
Figure 1 for Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation
Figure 2 for Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation
Figure 3 for Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation
Figure 4 for Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation
Viaarxiv icon

Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder

Add code
Feb 21, 2022
Figure 1 for Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
Figure 2 for Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
Figure 3 for Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
Figure 4 for Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
Viaarxiv icon

A Survey of Sound Source Localization with Deep Learning Methods

Add code
Sep 16, 2021
Figure 1 for A Survey of Sound Source Localization with Deep Learning Methods
Figure 2 for A Survey of Sound Source Localization with Deep Learning Methods
Figure 3 for A Survey of Sound Source Localization with Deep Learning Methods
Figure 4 for A Survey of Sound Source Localization with Deep Learning Methods
Viaarxiv icon