Picture for Jannis Vamvas

Jannis Vamvas

Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties

Add code
Mar 26, 2026
Viaarxiv icon

Robust Language Identification for Romansh Varieties

Add code
Mar 16, 2026
Viaarxiv icon

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

Add code
Mar 11, 2026
Viaarxiv icon

SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents

Add code
Dec 08, 2025
Viaarxiv icon

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Figure 1 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 2 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 3 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 4 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Viaarxiv icon

20min-XD: A Comparable Corpus of Swiss News Articles

Add code
Apr 30, 2025
Figure 1 for 20min-XD: A Comparable Corpus of Swiss News Articles
Figure 2 for 20min-XD: A Comparable Corpus of Swiss News Articles
Figure 3 for 20min-XD: A Comparable Corpus of Swiss News Articles
Figure 4 for 20min-XD: A Comparable Corpus of Swiss News Articles
Viaarxiv icon

Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents

Add code
Mar 13, 2025
Viaarxiv icon

Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents

Add code
May 13, 2024
Figure 1 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 2 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 3 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Figure 4 for Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents
Viaarxiv icon

Linear-time Minimum Bayes Risk Decoding with Reference Aggregation

Add code
Feb 06, 2024
Figure 1 for Linear-time Minimum Bayes Risk Decoding with Reference Aggregation
Figure 2 for Linear-time Minimum Bayes Risk Decoding with Reference Aggregation
Figure 3 for Linear-time Minimum Bayes Risk Decoding with Reference Aggregation
Figure 4 for Linear-time Minimum Bayes Risk Decoding with Reference Aggregation
Viaarxiv icon

Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect

Add code
Jan 25, 2024
Figure 1 for Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect
Figure 2 for Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect
Figure 3 for Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect
Viaarxiv icon