Alert button
Picture for Max Tegmark

Max Tegmark

Alert button

GenEFT: Understanding Statics and Dynamics of Model Generalization via Effective Theory

Feb 08, 2024
David D. Baek, Ziming Liu, Max Tegmark

Viaarxiv icon

Opening the AI black box: program synthesis via mechanistic interpretability

Feb 07, 2024
Eric J. Michaud, Isaac Liao, Vedang Lad, Ziming Liu, Anish Mudide, Chloe Loughridge, Zifan Carl Guo, Tara Rezaei Kheirkhah, Mateja Vukelić, Max Tegmark

Viaarxiv icon

A Resource Model For Neural Scaling Law

Feb 07, 2024
Jinyeop Song, Ziming Liu, Max Tegmark, Jeff Gore

Viaarxiv icon

Black-Box Access is Insufficient for Rigorous AI Audits

Jan 25, 2024
Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell

Viaarxiv icon

Generating Interpretable Networks using Hypernetworks

Dec 05, 2023
Isaac Liao, Ziming Liu, Max Tegmark

Figure 1 for Generating Interpretable Networks using Hypernetworks
Figure 2 for Generating Interpretable Networks using Hypernetworks
Figure 3 for Generating Interpretable Networks using Hypernetworks
Figure 4 for Generating Interpretable Networks using Hypernetworks
Viaarxiv icon

Growing Brains: Co-emergence of Anatomical and Functional Modularity in Recurrent Neural Networks

Oct 11, 2023
Ziming Liu, Mikail Khona, Ila R. Fiete, Max Tegmark

Viaarxiv icon

The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets

Oct 10, 2023
Samuel Marks, Max Tegmark

Viaarxiv icon

Divide-and-Conquer Dynamics in AI-Driven Disempowerment

Oct 09, 2023
Peter S. Park, Max Tegmark

Viaarxiv icon

Grokking as Compression: A Nonlinear Complexity Perspective

Oct 09, 2023
Ziming Liu, Ziqian Zhong, Max Tegmark

Viaarxiv icon

A Neural Scaling Law from Lottery Ticket Ensembling

Oct 03, 2023
Ziming Liu, Max Tegmark

Viaarxiv icon