Picture for Max Tegmark

Max Tegmark

MIT

On the creation of narrow AI: hierarchy and nonlocality of neural network skills

Add code
May 21, 2025
Viaarxiv icon

Neural Thermodynamic Laws for Large Language Model Training

Add code
May 15, 2025
Viaarxiv icon

Scaling Laws For Scalable Oversight

Add code
Apr 25, 2025
Viaarxiv icon

Do Two AI Scientists Agree?

Add code
Apr 03, 2025
Viaarxiv icon

Towards Understanding Distilled Reasoning Models: A Representational Approach

Add code
Mar 05, 2025
Viaarxiv icon

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

Add code
Feb 23, 2025
Viaarxiv icon

Harmonic Loss Trains Interpretable AI Models

Add code
Feb 03, 2025
Viaarxiv icon

Language Models Use Trigonometry to Do Addition

Add code
Feb 02, 2025
Viaarxiv icon

Low-Rank Adapting Models for Sparse Autoencoders

Add code
Jan 31, 2025
Figure 1 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 2 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 3 for Low-Rank Adapting Models for Sparse Autoencoders
Figure 4 for Low-Rank Adapting Models for Sparse Autoencoders
Viaarxiv icon

Open Problems in Mechanistic Interpretability

Add code
Jan 27, 2025
Figure 1 for Open Problems in Mechanistic Interpretability
Figure 2 for Open Problems in Mechanistic Interpretability
Figure 3 for Open Problems in Mechanistic Interpretability
Figure 4 for Open Problems in Mechanistic Interpretability
Viaarxiv icon