Picture for Thomas Thebaud

Thomas Thebaud

LIUM

DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers

Add code
Mar 23, 2026
Viaarxiv icon

Can LLMs Help Localize Fake Words in Partially Fake Speech?

Add code
Mar 11, 2026
Viaarxiv icon

Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation

Add code
Mar 11, 2026
Viaarxiv icon

Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization

Add code
Dec 17, 2025
Figure 1 for Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
Figure 2 for Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
Figure 3 for Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
Figure 4 for Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
Viaarxiv icon

Multi-Target Backdoor Attacks Against Speaker Recognition

Add code
Aug 13, 2025
Viaarxiv icon

Enhancing Dialogue Annotation with Speaker Characteristics Leveraging a Frozen LLM

Add code
Aug 06, 2025
Viaarxiv icon

Rhythm Features for Speaker Identification

Add code
Jun 07, 2025
Viaarxiv icon

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Add code
May 25, 2025
Viaarxiv icon

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Add code
May 20, 2025
Viaarxiv icon

Demographic Attributes Prediction from Speech Using WavLM Embeddings

Add code
Feb 17, 2025
Viaarxiv icon