Flores


Model-Based Quality Assessment for Massively Multilingual Parallel Data

Add code
May 29, 2026
Viaarxiv icon

HardMTBench: Stress-Testing Chinese-English Translation on Knowledge-Intensive Domains

Add code
May 27, 2026
Viaarxiv icon

Sense Representations Are Inducible Interfaces

Add code
May 27, 2026
Viaarxiv icon

DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge

Add code
May 21, 2026
Viaarxiv icon

SomaliWeb v1: A Quality-Filtered Somali Web Corpus with a Matched Tokenizer and a Public Language-Identification Benchmark

Add code
May 18, 2026
Viaarxiv icon

Context Training with Active Information Seeking

Add code
May 14, 2026
Viaarxiv icon

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

Add code
Apr 03, 2026
Viaarxiv icon

VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models

Add code
Mar 19, 2026
Viaarxiv icon

Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

Add code
Mar 17, 2026
Viaarxiv icon

On the (Generative) Linear Sketching Problem

Add code
Mar 15, 2026
Viaarxiv icon