Picture for Michal Štefánik

Michal Štefánik

Faculty of Informatics Masaryk University

How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data

Add code
Apr 15, 2026
Viaarxiv icon

Unravelling the Mechanisms of Manipulating Numbers in Language Models

Add code
Oct 30, 2025
Figure 1 for Unravelling the Mechanisms of Manipulating Numbers in Language Models
Figure 2 for Unravelling the Mechanisms of Manipulating Numbers in Language Models
Figure 3 for Unravelling the Mechanisms of Manipulating Numbers in Language Models
Figure 4 for Unravelling the Mechanisms of Manipulating Numbers in Language Models
Viaarxiv icon

Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers

Add code
Jun 10, 2025
Figure 1 for Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers
Figure 2 for Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers
Figure 3 for Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers
Figure 4 for Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers
Viaarxiv icon

Negation: A Pink Elephant in the Large Language Models' Room?

Add code
Mar 28, 2025
Figure 1 for Negation: A Pink Elephant in the Large Language Models' Room?
Figure 2 for Negation: A Pink Elephant in the Large Language Models' Room?
Figure 3 for Negation: A Pink Elephant in the Large Language Models' Room?
Figure 4 for Negation: A Pink Elephant in the Large Language Models' Room?
Viaarxiv icon

Self-training Language Models for Arithmetic Reasoning

Add code
Jul 11, 2024
Viaarxiv icon

Concept-aware Data Construction Improves In-context Learning of Language Models

Add code
Mar 08, 2024
Viaarxiv icon

People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts

Add code
Jun 06, 2023
Viaarxiv icon

Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems

Add code
May 24, 2023
Figure 1 for Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems
Figure 2 for Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems
Figure 3 for Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems
Figure 4 for Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems
Viaarxiv icon

Concept-aware Training Improves In-context Learning Ability of Language Models

Add code
May 23, 2023
Figure 1 for Concept-aware Training Improves In-context Learning Ability of Language Models
Figure 2 for Concept-aware Training Improves In-context Learning Ability of Language Models
Viaarxiv icon

Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models

Add code
May 11, 2023
Viaarxiv icon