Alert button
Picture for Naomi Saphra

Naomi Saphra

Alert button

Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations

Nov 29, 2023
Yash Gondhalekar, Sultan Hassan, Naomi Saphra, Sambatra Andrianomena

Figure 1 for Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations
Figure 2 for Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations
Viaarxiv icon

Attribute Diversity Determines the Systematicity Gap in VQA

Nov 15, 2023
Ian Berlot-Attwell, A. Michael Carrell, Kumar Krishna Agrawal, Yash Sharma, Naomi Saphra

Viaarxiv icon

First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models

Nov 08, 2023
Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez

Viaarxiv icon

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Oct 05, 2023
Tom Sherborne, Naomi Saphra, Pradeep Dasigi, Hao Peng

Figure 1 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 2 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 3 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 4 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Viaarxiv icon

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Sep 28, 2023
Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho, Matthew L. Leavitt, Naomi Saphra

Viaarxiv icon

Latent State Models of Training Dynamics

Aug 18, 2023
Michael Y. Hu, Angelica Chen, Naomi Saphra, Kyunghyun Cho

Figure 1 for Latent State Models of Training Dynamics
Figure 2 for Latent State Models of Training Dynamics
Figure 3 for Latent State Models of Training Dynamics
Figure 4 for Latent State Models of Training Dynamics
Viaarxiv icon

Dynamic Masking Rate Schedules for MLM Pretraining

May 24, 2023
Zachary Ankner, Naomi Saphra, Davis Blalock, Jonathan Frankle, Matthew L. Leavitt

Figure 1 for Dynamic Masking Rate Schedules for MLM Pretraining
Figure 2 for Dynamic Masking Rate Schedules for MLM Pretraining
Figure 3 for Dynamic Masking Rate Schedules for MLM Pretraining
Figure 4 for Dynamic Masking Rate Schedules for MLM Pretraining
Viaarxiv icon

One Venue, Two Conferences: The Separation of Chinese and American Citation Networks

Nov 17, 2022
Bingchen Zhao, Yuling Gu, Jessica Zosa Forde, Naomi Saphra

Figure 1 for One Venue, Two Conferences: The Separation of Chinese and American Citation Networks
Viaarxiv icon

State-of-the-art generalisation research in NLP: a taxonomy and review

Oct 10, 2022
Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhijing Jin

Figure 1 for State-of-the-art generalisation research in NLP: a taxonomy and review
Figure 2 for State-of-the-art generalisation research in NLP: a taxonomy and review
Figure 3 for State-of-the-art generalisation research in NLP: a taxonomy and review
Figure 4 for State-of-the-art generalisation research in NLP: a taxonomy and review
Viaarxiv icon