Picture for Jack Merullo

Jack Merullo

Transferring Features Across Language Models With Model Stitching

Add code
Jun 07, 2025
Viaarxiv icon

On Linear Representations and Pretraining Data Frequency in Language Models

Add code
Apr 16, 2025
Viaarxiv icon

$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources

Add code
Oct 30, 2024
Viaarxiv icon

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models

Add code
Jun 13, 2024
Viaarxiv icon

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

Add code
May 28, 2024
Figure 1 for Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Figure 2 for Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Figure 3 for Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Figure 4 for Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Viaarxiv icon

Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models

Add code
May 03, 2024
Figure 1 for Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
Figure 2 for Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
Figure 3 for Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
Figure 4 for Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
Viaarxiv icon

Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks

Add code
Feb 13, 2024
Viaarxiv icon

Characterizing Mechanisms for Factual Recall in Language Models

Add code
Oct 24, 2023
Viaarxiv icon

Circuit Component Reuse Across Tasks in Transformer Language Models

Add code
Oct 12, 2023
Viaarxiv icon

Language Models Implement Simple Word2Vec-style Vector Arithmetic

Add code
May 25, 2023
Figure 1 for Language Models Implement Simple Word2Vec-style Vector Arithmetic
Figure 2 for Language Models Implement Simple Word2Vec-style Vector Arithmetic
Figure 3 for Language Models Implement Simple Word2Vec-style Vector Arithmetic
Figure 4 for Language Models Implement Simple Word2Vec-style Vector Arithmetic
Viaarxiv icon