Picture for Thomas Hofmann

Thomas Hofmann

ETH Zurich

Landscaping Linear Mode Connectivity

Add code
Jun 24, 2024
Viaarxiv icon

Explicit Word Density Estimation for Language Modelling

Add code
Jun 10, 2024
Viaarxiv icon

Causal Estimation of Memorisation Profiles

Add code
Jun 06, 2024
Viaarxiv icon

Understanding and Minimising Outlier Features in Neural Network Training

Add code
May 29, 2024
Figure 1 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 2 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 3 for Understanding and Minimising Outlier Features in Neural Network Training
Figure 4 for Understanding and Minimising Outlier Features in Neural Network Training
Viaarxiv icon

Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control

Add code
Apr 21, 2024
Figure 1 for Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Figure 2 for Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Figure 3 for Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Figure 4 for Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Viaarxiv icon

Language Imbalance Can Boost Cross-lingual Generalisation

Add code
Apr 11, 2024
Viaarxiv icon

On the Effect of Duplicate Subwords in Language Modelling

Add code
Apr 09, 2024
Figure 1 for On the Effect of  Duplicate Subwords in Language Modelling
Figure 2 for On the Effect of  Duplicate Subwords in Language Modelling
Figure 3 for On the Effect of  Duplicate Subwords in Language Modelling
Figure 4 for On the Effect of  Duplicate Subwords in Language Modelling
Viaarxiv icon

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends

Add code
Mar 12, 2024
Figure 1 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 2 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 3 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 4 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Viaarxiv icon

Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

Add code
Feb 27, 2024
Figure 1 for Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning
Figure 2 for Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning
Figure 3 for Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning
Figure 4 for Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning
Viaarxiv icon

A Language Model's Guide Through Latent Space

Add code
Feb 22, 2024
Figure 1 for A Language Model's Guide Through Latent Space
Figure 2 for A Language Model's Guide Through Latent Space
Figure 3 for A Language Model's Guide Through Latent Space
Figure 4 for A Language Model's Guide Through Latent Space
Viaarxiv icon