Picture for John H. L. Hansen

John H. L. Hansen

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings

Add code
Jul 05, 2024
Viaarxiv icon

Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification

Add code
Mar 01, 2024
Figure 1 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 2 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 3 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Figure 4 for Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
Viaarxiv icon

Multi-objective Non-intrusive Hearing-aid Speech Assessment Model

Add code
Nov 15, 2023
Figure 1 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 2 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 3 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 4 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Viaarxiv icon

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

Add code
Oct 27, 2023
Figure 1 for MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
Figure 2 for MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
Figure 3 for MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
Figure 4 for MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
Viaarxiv icon

Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition

Add code
Oct 17, 2023
Figure 1 for Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition
Figure 2 for Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition
Figure 3 for Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition
Figure 4 for Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition
Viaarxiv icon

What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model

Add code
Jun 10, 2023
Figure 1 for What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
Figure 2 for What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
Figure 3 for What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
Figure 4 for What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
Viaarxiv icon

Improving Transformer-based Networks With Locality For Automatic Speaker Verification

Add code
Feb 28, 2023
Figure 1 for Improving Transformer-based Networks With Locality For Automatic Speaker Verification
Figure 2 for Improving Transformer-based Networks With Locality For Automatic Speaker Verification
Figure 3 for Improving Transformer-based Networks With Locality For Automatic Speaker Verification
Figure 4 for Improving Transformer-based Networks With Locality For Automatic Speaker Verification
Viaarxiv icon

Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation

Add code
Nov 22, 2022
Figure 1 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 2 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 3 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Figure 4 for Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
Viaarxiv icon

Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise

Add code
Nov 19, 2022
Figure 1 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 2 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 3 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 4 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Viaarxiv icon

Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning

Add code
Nov 17, 2022
Figure 1 for Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
Figure 2 for Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
Figure 3 for Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
Figure 4 for Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning
Viaarxiv icon