Picture for Marc Delcroix

Marc Delcroix

Lightweight Zero-shot Text-to-Speech with Mixture of Adapters

Add code
Jul 01, 2024
Viaarxiv icon

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling

Add code
Jul 01, 2024
Figure 1 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 2 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 3 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Viaarxiv icon

Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over

Add code
Jun 27, 2024
Viaarxiv icon

Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance

Add code
Apr 23, 2024
Figure 1 for Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Figure 2 for Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Figure 3 for Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Figure 4 for Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
Viaarxiv icon

Target Speech Extraction with Pre-trained Self-supervised Learning Models

Add code
Feb 17, 2024
Figure 1 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 2 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 3 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Figure 4 for Target Speech Extraction with Pre-trained Self-supervised Learning Models
Viaarxiv icon

Probing Self-supervised Learning Models with Target Speech Extraction

Add code
Feb 17, 2024
Viaarxiv icon

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

Add code
Feb 05, 2024
Figure 1 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 2 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 3 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Figure 4 for Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
Viaarxiv icon

What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis

Add code
Jan 31, 2024
Viaarxiv icon

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters

Add code
Jan 10, 2024
Figure 1 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 2 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 3 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Figure 4 for Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
Viaarxiv icon

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition

Add code
Dec 22, 2023
Viaarxiv icon