Alert button
Picture for Thomas Hofmann

Thomas Hofmann

Alert button

Language Imbalance Can Boost Cross-lingual Generalisation

Add code
Bookmark button
Alert button
Apr 18, 2024
Anton Schäfer, Shauli Ravfogel, Thomas Hofmann, Tiago Pimentel, Imanol Schlag

Viaarxiv icon

On the Effect of (Near) Duplicate Subwords in Language Modelling

Add code
Bookmark button
Alert button
Apr 09, 2024
Anton Schäfer, Thomas Hofmann, Imanol Schlag, Tiago Pimentel

Viaarxiv icon

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends

Add code
Bookmark button
Alert button
Mar 12, 2024
Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf

Figure 1 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 2 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 3 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 4 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Viaarxiv icon

Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

Add code
Bookmark button
Alert button
Feb 27, 2024
Lorenzo Noci, Alexandru Meterez, Thomas Hofmann, Antonio Orvieto

Viaarxiv icon

A Language Model's Guide Through Latent Space

Add code
Bookmark button
Alert button
Feb 22, 2024
Dimitri von Rütte, Sotiris Anagnostidis, Gregor Bachmann, Thomas Hofmann

Viaarxiv icon

Towards Meta-Pruning via Optimal Transport

Add code
Bookmark button
Alert button
Feb 13, 2024
Alexander Theus, Olin Geimer, Friedrich Wicke, Thomas Hofmann, Sotiris Anagnostidis, Sidak Pal Singh

Viaarxiv icon

How Good is a Single Basin?

Add code
Bookmark button
Alert button
Feb 05, 2024
Kai Lion, Lorenzo Noci, Thomas Hofmann, Gregor Bachmann

Viaarxiv icon

Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architectures

Add code
Bookmark button
Alert button
Jan 29, 2024
Michael Hersche, Francesco di Stefano, Thomas Hofmann, Abu Sebastian, Abbas Rahimi

Viaarxiv icon

Disentangling Linear Mode-Connectivity

Add code
Bookmark button
Alert button
Dec 15, 2023
Gul Sena Altintas, Gregor Bachmann, Lorenzo Noci, Thomas Hofmann

Viaarxiv icon