Picture for Gustav Eje Henter

Gustav Eje Henter

The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs

Add code
Mar 15, 2026
Viaarxiv icon

VoXtream2: Full-stream TTS with dynamic speaking rate control

Add code
Mar 13, 2026
Viaarxiv icon

Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias

Add code
Sep 26, 2025
Viaarxiv icon

VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency

Add code
Sep 19, 2025
Viaarxiv icon

EmojiVoice: Towards long-term controllable expressivity in robot speech

Add code
Jun 18, 2025
Viaarxiv icon

CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models

Add code
Dec 23, 2024
Figure 1 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
Figure 2 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
Figure 3 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
Figure 4 for CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
Viaarxiv icon

Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis

Add code
Oct 08, 2024
Figure 1 for Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Figure 2 for Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Figure 3 for Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Figure 4 for Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Viaarxiv icon

Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework

Add code
Jun 12, 2024
Viaarxiv icon

Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech

Add code
Jun 08, 2024
Viaarxiv icon

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis

Add code
Apr 30, 2024
Viaarxiv icon