Picture for Mengzhe Geng

Mengzhe Geng

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

Add code
Jun 14, 2024
Figure 1 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 2 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 3 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 4 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Viaarxiv icon

An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios

Add code
Jun 13, 2024
Viaarxiv icon

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

Add code
Jan 01, 2024
Figure 1 for Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
Figure 2 for Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
Figure 3 for Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

Towards Automatic Data Augmentation for Disordered Speech Recognition

Add code
Dec 14, 2023
Viaarxiv icon

Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition

Add code
Jul 06, 2023
Figure 1 for Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Figure 2 for Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Figure 3 for Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Figure 4 for Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition
Viaarxiv icon

Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition

Add code
Jun 27, 2023
Viaarxiv icon

Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems

Add code
Jun 26, 2023
Figure 1 for Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Figure 2 for Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Figure 3 for Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Figure 4 for Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Viaarxiv icon

Use of Speech Impairment Severity for Dysarthric Speech Recognition

Add code
May 18, 2023
Viaarxiv icon

Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition

Add code
Feb 28, 2023
Figure 1 for Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition
Figure 2 for Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition
Figure 3 for Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition
Figure 4 for Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition
Viaarxiv icon