Vietnamese Speech Data


Human-Guided Reasoning with Large Language Models for Vietnamese Speech Emotion Recognition

Add code
Apr 02, 2026
Viaarxiv icon

Vietnamese Automatic Speech Recognition: A Revisit

Add code
Mar 16, 2026
Viaarxiv icon

SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia

Add code
Mar 17, 2026
Viaarxiv icon

ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition

Add code
Jun 05, 2025
Viaarxiv icon

MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation

Add code
Apr 04, 2025
Viaarxiv icon

AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR

Add code
Jan 13, 2025
Figure 1 for AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Figure 2 for AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Figure 3 for AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Figure 4 for AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Viaarxiv icon

A Big Data-empowered System for Real-time Detection of Regional Discriminatory Comments on Vietnamese Social Media

Add code
Oct 29, 2024
Figure 1 for A Big Data-empowered System for Real-time Detection of Regional Discriminatory Comments on Vietnamese Social Media
Figure 2 for A Big Data-empowered System for Real-time Detection of Regional Discriminatory Comments on Vietnamese Social Media
Figure 3 for A Big Data-empowered System for Real-time Detection of Regional Discriminatory Comments on Vietnamese Social Media
Figure 4 for A Big Data-empowered System for Real-time Detection of Regional Discriminatory Comments on Vietnamese Social Media
Viaarxiv icon

Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges

Add code
Oct 04, 2024
Figure 1 for Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges
Figure 2 for Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges
Figure 3 for Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges
Figure 4 for Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges
Viaarxiv icon

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Add code
Jun 17, 2024
Viaarxiv icon

MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder

Add code
Sep 21, 2024
Figure 1 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 2 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 3 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 4 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Viaarxiv icon