Picture for Milos Cernak

Milos Cernak

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Add code
Jun 09, 2023
Figure 1 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 2 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 3 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Figure 4 for Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Viaarxiv icon

ALO-VC: Any-to-any Low-latency One-shot Voice Conversion

Add code
Jun 01, 2023
Figure 1 for ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Figure 2 for ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Figure 3 for ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Figure 4 for ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Viaarxiv icon

BC-VAD: A Robust Bone Conduction Voice Activity Detection

Add code
Dec 06, 2022
Viaarxiv icon

Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings

Add code
Nov 12, 2022
Figure 1 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 2 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 3 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 4 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Viaarxiv icon

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

Add code
Jun 30, 2022
Figure 1 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 2 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 3 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Figure 4 for BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Viaarxiv icon

MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment

Add code
Apr 04, 2022
Figure 1 for MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment
Figure 2 for MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment
Figure 3 for MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment
Figure 4 for MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment
Viaarxiv icon

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

Add code
Mar 30, 2022
Figure 1 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 2 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 3 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Figure 4 for Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Viaarxiv icon

AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion

Add code
Nov 12, 2021
Figure 1 for AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion
Figure 2 for AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion
Viaarxiv icon

Power efficient analog features for audio recognition

Add code
Oct 07, 2021
Figure 1 for Power efficient analog features for audio recognition
Figure 2 for Power efficient analog features for audio recognition
Figure 3 for Power efficient analog features for audio recognition
Figure 4 for Power efficient analog features for audio recognition
Viaarxiv icon

SERAB: A multi-lingual benchmark for speech emotion recognition

Add code
Oct 07, 2021
Figure 1 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 2 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 3 for SERAB: A multi-lingual benchmark for speech emotion recognition
Figure 4 for SERAB: A multi-lingual benchmark for speech emotion recognition
Viaarxiv icon