Picture for Steven Krenn

Steven Krenn

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Viaarxiv icon

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Add code
Jan 22, 2024
Viaarxiv icon

Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio

Add code
Nov 01, 2023
Viaarxiv icon

Multiface: A Dataset for Neural Face Rendering

Add code
Jul 22, 2022
Figure 1 for Multiface: A Dataset for Neural Face Rendering
Figure 2 for Multiface: A Dataset for Neural Face Rendering
Figure 3 for Multiface: A Dataset for Neural Face Rendering
Figure 4 for Multiface: A Dataset for Neural Face Rendering
Viaarxiv icon

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis

Add code
Mar 31, 2022
Figure 1 for Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Figure 2 for Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Figure 3 for Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Figure 4 for Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Viaarxiv icon