Picture for Naomi Saphra

Naomi Saphra

ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context

Add code
Jul 10, 2024
Viaarxiv icon

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Add code
Jun 25, 2024
Figure 1 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 2 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 3 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 4 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Viaarxiv icon

Transcendence: Generative Models Can Outperform The Experts That Train Them

Add code
Jun 17, 2024
Figure 1 for Transcendence: Generative Models Can Outperform The Experts That Train Them
Figure 2 for Transcendence: Generative Models Can Outperform The Experts That Train Them
Figure 3 for Transcendence: Generative Models Can Outperform The Experts That Train Them
Figure 4 for Transcendence: Generative Models Can Outperform The Experts That Train Them
Viaarxiv icon

Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data

Add code
Mar 19, 2024
Figure 1 for Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data
Figure 2 for Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data
Figure 3 for Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data
Figure 4 for Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data
Viaarxiv icon

Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations

Add code
Nov 29, 2023
Figure 1 for Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations
Figure 2 for Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations
Viaarxiv icon

Attribute Diversity Determines the Systematicity Gap in VQA

Add code
Nov 15, 2023
Figure 1 for Attribute Diversity Determines the Systematicity Gap in VQA
Figure 2 for Attribute Diversity Determines the Systematicity Gap in VQA
Figure 3 for Attribute Diversity Determines the Systematicity Gap in VQA
Figure 4 for Attribute Diversity Determines the Systematicity Gap in VQA
Viaarxiv icon

First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models

Add code
Nov 08, 2023
Viaarxiv icon

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Add code
Oct 05, 2023
Figure 1 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 2 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 3 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 4 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Viaarxiv icon

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Add code
Sep 28, 2023
Viaarxiv icon

Latent State Models of Training Dynamics

Add code
Aug 18, 2023
Figure 1 for Latent State Models of Training Dynamics
Figure 2 for Latent State Models of Training Dynamics
Figure 3 for Latent State Models of Training Dynamics
Figure 4 for Latent State Models of Training Dynamics
Viaarxiv icon