Picture for Najim Dehak

Najim Dehak

Clean Label Attacks against SLU Systems

Add code
Sep 13, 2024
Figure 1 for Clean Label Attacks against SLU Systems
Figure 2 for Clean Label Attacks against SLU Systems
Figure 3 for Clean Label Attacks against SLU Systems
Figure 4 for Clean Label Attacks against SLU Systems
Viaarxiv icon

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Add code
Sep 12, 2024
Viaarxiv icon

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

Add code
Sep 11, 2024
Figure 1 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 2 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 3 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 4 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Viaarxiv icon

Noise-robust Speech Separation with Fast Generative Correction

Add code
Jun 11, 2024
Viaarxiv icon

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification

Add code
Feb 29, 2024
Figure 1 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 2 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 3 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 4 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Viaarxiv icon

Time Scale Network: A Shallow Neural Network For Time Series Data

Add code
Nov 10, 2023
Viaarxiv icon

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction

Add code
Oct 10, 2023
Viaarxiv icon

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Add code
Sep 08, 2023
Figure 1 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 2 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 3 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 4 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Viaarxiv icon

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

Add code
Jun 18, 2023
Figure 1 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 2 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 3 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 4 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Viaarxiv icon

Regularizing Contrastive Predictive Coding for Speech Applications

Add code
Apr 26, 2023
Figure 1 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 2 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 3 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 4 for Regularizing Contrastive Predictive Coding for Speech Applications
Viaarxiv icon