Picture for Gregor Bachmann

Gregor Bachmann

The pitfalls of next-token prediction

Add code
Mar 11, 2024
Viaarxiv icon

A Language Model's Guide Through Latent Space

Add code
Feb 22, 2024
Viaarxiv icon

How Good is a Single Basin?

Add code
Feb 05, 2024
Viaarxiv icon

Disentangling Linear Mode-Connectivity

Add code
Dec 15, 2023
Viaarxiv icon

Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies

Add code
Nov 06, 2023
Viaarxiv icon

Scaling MLPs: A Tale of Inductive Bias

Add code
Jun 23, 2023
Viaarxiv icon

Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes

Add code
Jun 04, 2023
Viaarxiv icon

CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes

Add code
Apr 12, 2023
Viaarxiv icon

Random Teachers are Good Teachers

Add code
Feb 23, 2023
Viaarxiv icon

The Curious Case of Benign Memorization

Add code
Oct 25, 2022
Viaarxiv icon