Vietnamese Datasets


Human-Guided Reasoning with Large Language Models for Vietnamese Speech Emotion Recognition

Add code
Apr 02, 2026
Viaarxiv icon

ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks

Add code
Mar 22, 2026
Viaarxiv icon

ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon

Vietnamese Automatic Speech Recognition: A Revisit

Add code
Mar 16, 2026
Viaarxiv icon

SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia

Add code
Mar 17, 2026
Viaarxiv icon

AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering

Add code
Mar 11, 2026
Viaarxiv icon

End-to-End Chatbot Evaluation with Adaptive Reasoning and Uncertainty Filtering

Add code
Mar 11, 2026
Viaarxiv icon

ViDia2Std: A Parallel Corpus and Methods for Low-Resource Vietnamese Dialect-to-Standard Translation

Add code
Mar 10, 2026
Viaarxiv icon

VietJobs: A Vietnamese Job Advertisement Dataset

Add code
Mar 05, 2026
Viaarxiv icon

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Add code
Feb 13, 2026
Viaarxiv icon