Picture for Rao Ma

Rao Ma

Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction

Add code
May 27, 2025
Viaarxiv icon

Assessment of L2 Oral Proficiency using Speech Large Language Models

Add code
May 27, 2025
Viaarxiv icon

Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs

Add code
May 20, 2025
Viaarxiv icon

LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

Add code
May 16, 2025
Viaarxiv icon

ASR Error Correction using Large Language Models

Add code
Sep 14, 2024
Viaarxiv icon

Learn and Don't Forget: Adding a New Language to ASR Foundation Models

Add code
Jul 09, 2024
Figure 1 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 2 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 3 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 4 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Viaarxiv icon

Cross-Lingual Transfer Learning for Speech Translation

Add code
Jul 01, 2024
Figure 1 for Cross-Lingual Transfer Learning for Speech Translation
Figure 2 for Cross-Lingual Transfer Learning for Speech Translation
Figure 3 for Cross-Lingual Transfer Learning for Speech Translation
Figure 4 for Cross-Lingual Transfer Learning for Speech Translation
Viaarxiv icon

Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models

Add code
May 09, 2024
Figure 1 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 2 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 3 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 4 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Viaarxiv icon

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

Add code
Nov 15, 2023
Figure 1 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 2 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 3 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 4 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Viaarxiv icon

Towards End-to-End Spoken Grammatical Error Correction

Add code
Nov 09, 2023
Viaarxiv icon