Picture for Niklas Nolte

Niklas Nolte

Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking

Add code
May 30, 2025
Viaarxiv icon

Learning Distributions over Permutations and Rankings with Factorized Representations

Add code
May 30, 2025
Viaarxiv icon

Transformers Can Navigate Mazes With Multi-Step Prediction

Add code
Dec 06, 2024
Figure 1 for Transformers Can Navigate Mazes With Multi-Step Prediction
Figure 2 for Transformers Can Navigate Mazes With Multi-Step Prediction
Figure 3 for Transformers Can Navigate Mazes With Multi-Step Prediction
Figure 4 for Transformers Can Navigate Mazes With Multi-Step Prediction
Viaarxiv icon

MagicPIG: LSH Sampling for Efficient LLM Generation

Add code
Oct 21, 2024
Figure 1 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 2 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 3 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 4 for MagicPIG: LSH Sampling for Efficient LLM Generation
Viaarxiv icon

The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

Add code
Jun 07, 2024
Figure 1 for The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
Figure 2 for The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
Figure 3 for The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
Figure 4 for The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
Viaarxiv icon

From Neurons to Neutrons: A Case Study in Interpretability

Add code
May 27, 2024
Viaarxiv icon

Memory Mosaics

Add code
May 10, 2024
Figure 1 for Memory Mosaics
Figure 2 for Memory Mosaics
Figure 3 for Memory Mosaics
Figure 4 for Memory Mosaics
Viaarxiv icon

Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory

Add code
May 09, 2024
Figure 1 for Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory
Figure 2 for Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory
Figure 3 for Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory
Figure 4 for Transforming the Bootstrap: Using Transformers to Compute Scattering Amplitudes in Planar N = 4 Super Yang-Mills Theory
Viaarxiv icon

Salsa Fresca: Angular Embeddings and Pre-Training for ML Attacks on Learning With Errors

Add code
Feb 02, 2024
Viaarxiv icon

KBFormer: A Diffusion Model for Structured Entity Completion

Add code
Dec 08, 2023
Viaarxiv icon