Vietnamese Datasets


ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation

Add code
May 12, 2025
Viaarxiv icon

Validation of a 24-hour-ahead Prediction model for a Residential Electrical Load under diverse climate

Add code
May 01, 2025
Viaarxiv icon

Coreference Resolution for Vietnamese Narrative Texts

Add code
Apr 28, 2025
Viaarxiv icon

GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning

Add code
Apr 23, 2025
Viaarxiv icon

ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese

Add code
Apr 21, 2025
Viaarxiv icon

Nested Named-Entity Recognition on Vietnamese COVID-19: Dataset and Experiments

Add code
Apr 21, 2025
Viaarxiv icon

MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation

Add code
Apr 04, 2025
Viaarxiv icon

Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations

Add code
Mar 05, 2025
Viaarxiv icon

LiGT: Layout-infused Generative Transformer for Visual Question Answering on Vietnamese Receipts

Add code
Feb 26, 2025
Viaarxiv icon

A Large-Scale Benchmark for Vietnamese Sentence Paraphrases

Add code
Feb 11, 2025
Viaarxiv icon