Picture for John H. L. Hansen

John H. L. Hansen

TokenSE: a Mamba-based discrete token speech enhancement framework for cochlear implants

Add code
Apr 14, 2026
Viaarxiv icon

DAT-CFTNet: Speech Enhancement for Cochlear Implant Recipients using Attention-based Dual-Path Recurrent Neural Network

Add code
Apr 08, 2026
Viaarxiv icon

Emotion-Aware Prefix: Towards Explicit Emotion Control in Voice Conversion Models

Add code
Mar 10, 2026
Viaarxiv icon

DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding

Add code
Jun 11, 2025
Viaarxiv icon

UniPET-SPK: A Unified Framework for Parameter-Efficient Tuning of Pre-trained Speech Models for Robust Speaker Verification

Add code
Jan 27, 2025
Viaarxiv icon

DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification

Add code
Jan 09, 2025
Figure 1 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 2 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 3 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 4 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Viaarxiv icon

Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples

Add code
Aug 23, 2024
Figure 1 for Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Figure 2 for Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Figure 3 for Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Figure 4 for Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Viaarxiv icon

Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI

Add code
Jul 29, 2024
Figure 1 for Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI
Figure 2 for Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI
Viaarxiv icon

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings

Add code
Jul 05, 2024
Figure 1 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Figure 2 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Figure 3 for We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings
Viaarxiv icon

Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification

Add code
Mar 01, 2024
Figure 1 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 2 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 3 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 4 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Viaarxiv icon