Picture for Rao Ma

Rao Ma

Learn and Don't Forget: Adding a New Language to ASR Foundation Models

Add code
Jul 09, 2024
Viaarxiv icon

Cross-Lingual Transfer Learning for Speech Translation

Add code
Jul 01, 2024
Figure 1 for Cross-Lingual Transfer Learning for Speech Translation
Figure 2 for Cross-Lingual Transfer Learning for Speech Translation
Figure 3 for Cross-Lingual Transfer Learning for Speech Translation
Figure 4 for Cross-Lingual Transfer Learning for Speech Translation
Viaarxiv icon

Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models

Add code
May 09, 2024
Figure 1 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 2 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 3 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 4 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Viaarxiv icon

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

Add code
Nov 15, 2023
Figure 1 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 2 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 3 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 4 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Viaarxiv icon

Towards End-to-End Spoken Grammatical Error Correction

Add code
Nov 09, 2023
Viaarxiv icon

Zero-shot Audio Topic Reranking using Large Language Models

Add code
Sep 14, 2023
Figure 1 for Zero-shot Audio Topic Reranking using Large Language Models
Figure 2 for Zero-shot Audio Topic Reranking using Large Language Models
Figure 3 for Zero-shot Audio Topic Reranking using Large Language Models
Figure 4 for Zero-shot Audio Topic Reranking using Large Language Models
Viaarxiv icon

Adapting an ASR Foundation Model for Spoken Language Assessment

Add code
Jul 13, 2023
Figure 1 for Adapting an ASR Foundation Model for Spoken Language Assessment
Figure 2 for Adapting an ASR Foundation Model for Spoken Language Assessment
Figure 3 for Adapting an ASR Foundation Model for Spoken Language Assessment
Figure 4 for Adapting an ASR Foundation Model for Spoken Language Assessment
Viaarxiv icon

Can Generative Large Language Models Perform ASR Error Correction?

Add code
Jul 09, 2023
Figure 1 for Can Generative Large Language Models Perform ASR Error Correction?
Figure 2 for Can Generative Large Language Models Perform ASR Error Correction?
Figure 3 for Can Generative Large Language Models Perform ASR Error Correction?
Figure 4 for Can Generative Large Language Models Perform ASR Error Correction?
Viaarxiv icon

Adapting an Unadaptable ASR System

Add code
Jun 01, 2023
Figure 1 for Adapting an Unadaptable ASR System
Figure 2 for Adapting an Unadaptable ASR System
Figure 3 for Adapting an Unadaptable ASR System
Figure 4 for Adapting an Unadaptable ASR System
Viaarxiv icon

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Add code
Mar 01, 2023
Figure 1 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 2 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 3 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 4 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Viaarxiv icon