Picture for Imanol Schlag

Imanol Schlag

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Viaarxiv icon

Towards Fully FP8 GEMM LLM Training at Scale

Add code
May 26, 2025
Viaarxiv icon

Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks

Add code
May 19, 2025
Viaarxiv icon

Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs

Add code
Apr 08, 2025
Figure 1 for Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Figure 2 for Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Figure 3 for Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Figure 4 for Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Viaarxiv icon

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Add code
Nov 29, 2024
Figure 1 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 2 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 3 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 4 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Viaarxiv icon

Understanding and Minimising Outlier Features in Neural Network Training

Add code
May 29, 2024
Viaarxiv icon

Language Imbalance Can Boost Cross-lingual Generalisation

Add code
Apr 11, 2024
Viaarxiv icon

On the Effect of Duplicate Subwords in Language Modelling

Add code
Apr 09, 2024
Viaarxiv icon

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Add code
Sep 20, 2023
Figure 1 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 2 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 3 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 4 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Viaarxiv icon

Mindstorms in Natural Language-Based Societies of Mind

Add code
May 26, 2023
Figure 1 for Mindstorms in Natural Language-Based Societies of Mind
Figure 2 for Mindstorms in Natural Language-Based Societies of Mind
Figure 3 for Mindstorms in Natural Language-Based Societies of Mind
Figure 4 for Mindstorms in Natural Language-Based Societies of Mind
Viaarxiv icon