Picture for Andreea Tomescu

Andreea Tomescu

TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction

Add code
Jan 15, 2026
Viaarxiv icon

Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost

Add code
Sep 09, 2025
Figure 1 for Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Figure 2 for Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Figure 3 for Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Figure 4 for Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Viaarxiv icon

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

Add code
Apr 29, 2025
Viaarxiv icon

Synthetic Data Generation Using Large Language Models: Advances in Text and Code

Add code
Mar 18, 2025
Viaarxiv icon