Picture for George Close

George Close

ZONOS2 Technical Report

Add code
Jun 23, 2026
Viaarxiv icon

Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement

Add code
Jul 18, 2024
Figure 1 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 2 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 3 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Figure 4 for Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
Viaarxiv icon

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition

Add code
Jun 13, 2024
Figure 1 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 2 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 3 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Figure 4 for Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
Viaarxiv icon

Hallucination in Perceptual Metric-Driven Speech Enhancement Networks

Add code
Mar 18, 2024
Figure 1 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 2 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 3 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Figure 4 for Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
Viaarxiv icon

Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models

Add code
Jan 24, 2024
Figure 1 for Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models
Figure 2 for Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models
Figure 3 for Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models
Figure 4 for Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models
Viaarxiv icon

Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement

Add code
Dec 14, 2023
Figure 1 for Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement
Figure 2 for Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement
Viaarxiv icon

Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations

Add code
Jul 27, 2023
Figure 1 for Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations
Figure 2 for Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations
Figure 3 for Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations
Figure 4 for Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations
Viaarxiv icon

The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions

Add code
Jul 27, 2023
Figure 1 for The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions
Figure 2 for The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions
Figure 3 for The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions
Viaarxiv icon

Perceive and predict: self-supervised speech representation based loss functions for speech enhancement

Add code
Jan 11, 2023
Figure 1 for Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
Figure 2 for Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
Figure 3 for Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
Figure 4 for Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
Viaarxiv icon

MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data

Add code
Mar 29, 2022
Figure 1 for MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
Figure 2 for MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
Figure 3 for MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
Figure 4 for MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data
Viaarxiv icon