Alert button
Picture for Mikolaj Kegler

Mikolaj Kegler

Alert button

CATSE: A Context-Aware Framework for Causal Target Sound Extraction

Add code
Bookmark button
Alert button
Mar 21, 2024
Shrishail Baligar, Mikolaj Kegler, Bryce Irvin, Marko Stamenovic, Shawn Newsam

Figure 1 for CATSE: A Context-Aware Framework for Causal Target Sound Extraction
Figure 2 for CATSE: A Context-Aware Framework for Causal Target Sound Extraction
Figure 3 for CATSE: A Context-Aware Framework for Causal Target Sound Extraction
Figure 4 for CATSE: A Context-Aware Framework for Causal Target Sound Extraction
Viaarxiv icon

Latent CLAP Loss for Better Foley Sound Synthesis

Add code
Bookmark button
Alert button
Mar 18, 2024
Tornike Karchkhadze, Hassan Salami Kavaki, Mohammad Rasool Izadi, Bryce Irvin, Mikolaj Kegler, Ari Hertz, Shuo Zhang, Marko Stamenovic

Figure 1 for Latent CLAP Loss for Better Foley Sound Synthesis
Figure 2 for Latent CLAP Loss for Better Foley Sound Synthesis
Figure 3 for Latent CLAP Loss for Better Foley Sound Synthesis
Figure 4 for Latent CLAP Loss for Better Foley Sound Synthesis
Viaarxiv icon

Two-Step Knowledge Distillation for Tiny Speech Enhancement

Add code
Bookmark button
Alert button
Sep 15, 2023
Rayan Daod Nathoo, Mikolaj Kegler, Marko Stamenovic

Viaarxiv icon

Self-Supervised Learning for Speech Enhancement through Synthesis

Add code
Bookmark button
Alert button
Nov 04, 2022
Bryce Irvin, Marko Stamenovic, Mikolaj Kegler, Li-Chia Yang

Figure 1 for Self-Supervised Learning for Speech Enhancement through Synthesis
Figure 2 for Self-Supervised Learning for Speech Enhancement through Synthesis
Figure 3 for Self-Supervised Learning for Speech Enhancement through Synthesis
Figure 4 for Self-Supervised Learning for Speech Enhancement through Synthesis
Viaarxiv icon

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

Add code
Bookmark button
Alert button
Jun 30, 2022
Gasser Elbanna, Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Karl El Hajal, Milos Cernak

Figure 1 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 2 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 3 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 4 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Viaarxiv icon

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

Add code
Bookmark button
Alert button
Mar 30, 2022
Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow, Lara Orlandic, Pablo Mainar, Mikolaj Kegler, Pierre Beckmann, Milos Cernak

Figure 1 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 2 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 3 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 4 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Viaarxiv icon

SERAB: A multi-lingual benchmark for speech emotion recognition

Add code
Bookmark button
Alert button
Oct 07, 2021
Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Milos Cernak

Figure 1 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 2 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 3 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 4 for SERAB: A multi-lingual benchmark for speech emotion recognition
Viaarxiv icon

Speech-VGG: A deep feature extractor for speech processing

Add code
Bookmark button
Alert button
Oct 22, 2019
Pierre Beckmann, Mikolaj Kegler, Hugues Saltini, Milos Cernak

Figure 1 for Speech-VGG: A deep feature extractor for speech processing
Figure 2 for Speech-VGG: A deep feature extractor for speech processing
Figure 3 for Speech-VGG: A deep feature extractor for speech processing
Figure 4 for Speech-VGG: A deep feature extractor for speech processing
Viaarxiv icon

Deep speech inpainting of time-frequency masks

Add code
Bookmark button
Alert button
Oct 22, 2019
Mikolaj Kegler, Pierre Beckmann, Milos Cernak

Figure 1 for Deep speech inpainting of time-frequency masks
Figure 2 for Deep speech inpainting of time-frequency masks
Figure 3 for Deep speech inpainting of time-frequency masks
Figure 4 for Deep speech inpainting of time-frequency masks
Viaarxiv icon