Picture for Pontus Stenetorp

Pontus Stenetorp

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon

BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?

Add code
Jun 05, 2025
Viaarxiv icon

Multilingual Language Model Pretraining using Machine-translated Data

Add code
Feb 18, 2025
Viaarxiv icon

Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation

Add code
Feb 17, 2025
Figure 1 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 2 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 3 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Figure 4 for Warmup Generations: A Task-Agnostic Approach for Guiding Sequence-to-Sequence Learning with Unsupervised Initial State Generation
Viaarxiv icon

Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models

Add code
Nov 21, 2024
Viaarxiv icon

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language

Add code
Oct 31, 2024
Figure 1 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 2 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 3 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 4 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Viaarxiv icon

Jet Expansions of Residual Computation

Add code
Oct 08, 2024
Figure 1 for Jet Expansions of Residual Computation
Figure 2 for Jet Expansions of Residual Computation
Figure 3 for Jet Expansions of Residual Computation
Figure 4 for Jet Expansions of Residual Computation
Viaarxiv icon

Linguini: A benchmark for language-agnostic linguistic reasoning

Add code
Sep 18, 2024
Figure 1 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 2 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 3 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 4 for Linguini: A benchmark for language-agnostic linguistic reasoning
Viaarxiv icon

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

Add code
Jul 25, 2024
Viaarxiv icon