Picture for Milos Cernak

Milos Cernak

DeepFilterGAN: A Full-band Real-time Speech Enhancement System with GAN-based Stochastic Regeneration

Add code
May 29, 2025
Viaarxiv icon

Model as Loss: A Self-Consistent Training Paradigm

Add code
May 27, 2025
Viaarxiv icon

Semi-intrusive audio evaluation: Casting non-intrusive assessment as a multi-modal text prediction task

Add code
Sep 21, 2024
Viaarxiv icon

OpenACE: An Open Benchmark for Evaluating Audio Coding Performance

Add code
Sep 12, 2024
Figure 1 for OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Figure 2 for OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Figure 3 for OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Figure 4 for OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
Viaarxiv icon

Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule

Add code
Sep 08, 2024
Viaarxiv icon

On real-time multi-stage speech enhancement systems

Add code
Dec 19, 2023
Viaarxiv icon

Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model

Add code
Sep 21, 2023
Viaarxiv icon

Cluster-based pruning techniques for audio data

Add code
Sep 21, 2023
Viaarxiv icon

In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms

Add code
Sep 05, 2023
Viaarxiv icon

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Add code
Jun 09, 2023
Figure 1 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 2 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 3 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 4 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Viaarxiv icon