Picture for Thomas Hain

Thomas Hain

Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement

Add code
Jul 18, 2024
Figure 1 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 2 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 3 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 4 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Viaarxiv icon

Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis

Add code
Jul 04, 2024
Viaarxiv icon

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition

Add code
Jun 13, 2024
Figure 1 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 2 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 3 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 4 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Viaarxiv icon

LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks

Add code
Jun 13, 2024
Viaarxiv icon

EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Add code
Jun 11, 2024
Figure 1 for EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Figure 2 for EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Figure 3 for EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Figure 4 for EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Viaarxiv icon

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

Add code
May 30, 2024
Viaarxiv icon

Automatic Speech Recognition System-Independent Word Error Rate Estimation

Add code
Apr 26, 2024
Viaarxiv icon

Hallucination in Perceptual Metric-Driven Speech Enhancement Networks

Add code
Mar 18, 2024
Figure 1 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 2 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 3 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 4 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Viaarxiv icon

Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations

Add code
Mar 13, 2024
Figure 1 for Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Figure 2 for Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Figure 3 for Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Figure 4 for Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Viaarxiv icon

SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

Add code
Mar 10, 2024
Figure 1 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Figure 2 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Figure 3 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Viaarxiv icon