Picture for Tiago Pimentel

Tiago Pimentel

ETH Zurich

What Language is This? Ask Your Tokenizer

Add code
Feb 19, 2026
Viaarxiv icon

Operationalising the Superficial Alignment Hypothesis via Task Complexity

Add code
Feb 17, 2026
Viaarxiv icon

What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels

Add code
Dec 18, 2025
Figure 1 for What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels
Figure 2 for What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels
Figure 3 for What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels
Figure 4 for What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels
Viaarxiv icon

Do Generalisation Results Generalise?

Add code
Dec 08, 2025
Figure 1 for Do Generalisation Results Generalise?
Figure 2 for Do Generalisation Results Generalise?
Figure 3 for Do Generalisation Results Generalise?
Figure 4 for Do Generalisation Results Generalise?
Viaarxiv icon

Tokenisation over Bounded Alphabets is Hard

Add code
Nov 19, 2025
Figure 1 for Tokenisation over Bounded Alphabets is Hard
Viaarxiv icon

Convergence and Divergence of Language Models under Different Random Seeds

Add code
Sep 30, 2025
Figure 1 for Convergence and Divergence of Language Models under Different Random Seeds
Figure 2 for Convergence and Divergence of Language Models under Different Random Seeds
Figure 3 for Convergence and Divergence of Language Models under Different Random Seeds
Figure 4 for Convergence and Divergence of Language Models under Different Random Seeds
Viaarxiv icon

Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent

Add code
May 12, 2025
Viaarxiv icon

The time scale of redundancy between prosody and linguistic context

Add code
Mar 14, 2025
Viaarxiv icon

Tokenisation is NP-Complete

Add code
Dec 19, 2024
Viaarxiv icon

Towards a Similarity-adjusted Surprisal Theory

Add code
Oct 23, 2024
Figure 1 for Towards a Similarity-adjusted Surprisal Theory
Figure 2 for Towards a Similarity-adjusted Surprisal Theory
Figure 3 for Towards a Similarity-adjusted Surprisal Theory
Figure 4 for Towards a Similarity-adjusted Surprisal Theory
Viaarxiv icon