Picture for Marc Delcroix

Marc Delcroix

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters

Add code
Jan 10, 2024
Figure 1 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 2 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 3 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 4 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Viaarxiv icon

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition

Add code
Dec 22, 2023
Viaarxiv icon

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

Add code
Dec 20, 2023
Viaarxiv icon

Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss

Add code
Nov 20, 2023
Figure 1 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Figure 2 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Figure 3 for Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss
Viaarxiv icon

How does end-to-end speech recognition training impact speech enhancement artifacts?

Add code
Nov 20, 2023
Figure 1 for How does end-to-end speech recognition training impact speech enhancement artifacts?
Figure 2 for How does end-to-end speech recognition training impact speech enhancement artifacts?
Viaarxiv icon

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition

Add code
Oct 17, 2023
Viaarxiv icon

Discriminative Training of VBx Diarization

Add code
Oct 04, 2023
Figure 1 for Discriminative Training of VBx Diarization
Figure 2 for Discriminative Training of VBx Diarization
Figure 3 for Discriminative Training of VBx Diarization
Viaarxiv icon

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization

Add code
Sep 28, 2023
Viaarxiv icon

NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization

Add code
Sep 22, 2023
Viaarxiv icon

Target Speech Extraction with Conditional Diffusion Model

Add code
Aug 17, 2023
Figure 1 for Target Speech Extraction with Conditional Diffusion Model
Figure 2 for Target Speech Extraction with Conditional Diffusion Model
Figure 3 for Target Speech Extraction with Conditional Diffusion Model
Figure 4 for Target Speech Extraction with Conditional Diffusion Model
Viaarxiv icon