Picture for Pontus Stenetorp

Pontus Stenetorp

The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining

Add code
Jan 01, 2026
Viaarxiv icon

Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation

Add code
Oct 02, 2025
Figure 1 for Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Figure 2 for Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Figure 3 for Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Figure 4 for Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Viaarxiv icon

Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification

Add code
Sep 17, 2025
Figure 1 for Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
Figure 2 for Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
Figure 3 for Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
Figure 4 for Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon

BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?

Add code
Jun 05, 2025
Figure 1 for SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?
Figure 2 for SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?
Figure 3 for SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?
Figure 4 for SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?
Viaarxiv icon

Multilingual Language Model Pretraining using Machine-translated Data

Add code
Feb 18, 2025
Viaarxiv icon

Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation

Add code
Feb 17, 2025
Figure 1 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 2 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 3 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 4 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Viaarxiv icon

Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models

Add code
Nov 21, 2024
Figure 1 for Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
Figure 2 for Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
Figure 3 for Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
Figure 4 for Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
Viaarxiv icon

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language

Add code
Oct 31, 2024
Figure 1 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 2 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 3 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 4 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Viaarxiv icon