Picture for Cem Subakan

Cem Subakan

Open-Source Conversational AI with SpeechBrain 1.0

Add code
Jul 02, 2024
Figure 1 for Open-Source Conversational AI with SpeechBrain 1.0
Figure 2 for Open-Source Conversational AI with SpeechBrain 1.0
Viaarxiv icon

DASB -- Discrete Audio and Speech Benchmark

Add code
Jun 20, 2024
Figure 1 for DASB -- Discrete Audio and Speech Benchmark
Figure 2 for DASB -- Discrete Audio and Speech Benchmark
Figure 3 for DASB -- Discrete Audio and Speech Benchmark
Figure 4 for DASB -- Discrete Audio and Speech Benchmark
Viaarxiv icon

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?

Add code
Jun 15, 2024
Figure 1 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 2 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 3 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 4 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Viaarxiv icon

Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice

Add code
Jun 14, 2024
Figure 1 for Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
Figure 2 for Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
Figure 3 for Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
Figure 4 for Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
Viaarxiv icon

Listenable Maps for Zero-Shot Audio Classifiers

Add code
May 27, 2024
Figure 1 for Listenable Maps for Zero-Shot Audio Classifiers
Figure 2 for Listenable Maps for Zero-Shot Audio Classifiers
Figure 3 for Listenable Maps for Zero-Shot Audio Classifiers
Figure 4 for Listenable Maps for Zero-Shot Audio Classifiers
Viaarxiv icon

Listenable Maps for Audio Classifiers

Add code
Mar 19, 2024
Figure 1 for Listenable Maps for Audio Classifiers
Figure 2 for Listenable Maps for Audio Classifiers
Figure 3 for Listenable Maps for Audio Classifiers
Figure 4 for Listenable Maps for Audio Classifiers
Viaarxiv icon

Focal Modulation Networks for Interpretable Sound Classification

Add code
Feb 05, 2024
Viaarxiv icon

CL-MASR: A Continual Learning Benchmark for Multilingual ASR

Add code
Oct 25, 2023
Figure 1 for CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Figure 2 for CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Figure 3 for CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Figure 4 for CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Viaarxiv icon

Audio Editing with Non-Rigid Text Prompts

Add code
Oct 19, 2023
Figure 1 for Audio Editing with Non-Rigid Text Prompts
Figure 2 for Audio Editing with Non-Rigid Text Prompts
Figure 3 for Audio Editing with Non-Rigid Text Prompts
Figure 4 for Audio Editing with Non-Rigid Text Prompts
Viaarxiv icon

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice

Add code
May 29, 2023
Figure 1 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 2 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 3 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 4 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Viaarxiv icon