Picture for Noah D. Goodman

Noah D. Goodman

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Add code
Mar 18, 2024
Figure 1 for Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Figure 2 for Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Figure 3 for Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Figure 4 for Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Viaarxiv icon

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

Add code
Mar 12, 2024
Figure 1 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Figure 2 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Figure 3 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Viaarxiv icon

Evaluating and Optimizing Educational Content with Large Language Model Judgments

Add code
Mar 05, 2024
Figure 1 for Evaluating and Optimizing Educational Content with Large Language Model Judgments
Figure 2 for Evaluating and Optimizing Educational Content with Large Language Model Judgments
Figure 3 for Evaluating and Optimizing Educational Content with Large Language Model Judgments
Figure 4 for Evaluating and Optimizing Educational Content with Large Language Model Judgments
Viaarxiv icon

Automated Statistical Model Discovery with Language Models

Add code
Feb 27, 2024
Figure 1 for Automated Statistical Model Discovery with Language Models
Figure 2 for Automated Statistical Model Discovery with Language Models
Figure 3 for Automated Statistical Model Discovery with Language Models
Figure 4 for Automated Statistical Model Discovery with Language Models
Viaarxiv icon

A Reply to Makelov et al. 's "Interpretability Illusion" Arguments

Add code
Jan 23, 2024
Viaarxiv icon

Codebook Features: Sparse and Discrete Interpretability for Neural Networks

Add code
Oct 26, 2023
Figure 1 for Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Figure 2 for Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Figure 3 for Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Figure 4 for Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Viaarxiv icon

Social Contract AI: Aligning AI Assistants with Implicit Group Norms

Add code
Oct 26, 2023
Viaarxiv icon

CLEVRER-Humans: Describing Physical and Causal Events the Human Way

Add code
Oct 05, 2023
Viaarxiv icon

Hypothesis Search: Inductive Reasoning with Language Models

Add code
Sep 11, 2023
Figure 1 for Hypothesis Search: Inductive Reasoning with Language Models
Figure 2 for Hypothesis Search: Inductive Reasoning with Language Models
Figure 3 for Hypothesis Search: Inductive Reasoning with Language Models
Figure 4 for Hypothesis Search: Inductive Reasoning with Language Models
Viaarxiv icon

From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

Add code
Jun 23, 2023
Figure 1 for From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought
Figure 2 for From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought
Figure 3 for From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought
Figure 4 for From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought
Viaarxiv icon