Picture for Jacob Andreas

Jacob Andreas

Evaluating the Utility of Model Explanations for Model Development

Add code
Dec 10, 2023
Figure 1 for Evaluating the Utility of Model Explanations for Model Development
Figure 2 for Evaluating the Utility of Model Explanations for Model Development
Figure 3 for Evaluating the Utility of Model Explanations for Model Development
Figure 4 for Evaluating the Utility of Model Explanations for Model Development
Viaarxiv icon

Modeling Boundedly Rational Agents with Latent Inference Budgets

Add code
Dec 07, 2023
Viaarxiv icon

Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?

Add code
Nov 27, 2023
Viaarxiv icon

Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

Add code
Nov 16, 2023
Figure 1 for Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Figure 2 for Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Figure 3 for Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Figure 4 for Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Viaarxiv icon

Interpreting User Requests in the Context of Natural Language Standing Instructions

Add code
Nov 16, 2023
Viaarxiv icon

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling

Add code
Nov 15, 2023
Figure 1 for Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Figure 2 for Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Figure 3 for Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Figure 4 for Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Viaarxiv icon

LILO: Learning Interpretable Libraries by Compressing and Documenting Code

Add code
Oct 30, 2023
Figure 1 for LILO: Learning Interpretable Libraries by Compressing and Documenting Code
Figure 2 for LILO: Learning Interpretable Libraries by Compressing and Documenting Code
Figure 3 for LILO: Learning Interpretable Libraries by Compressing and Documenting Code
Figure 4 for LILO: Learning Interpretable Libraries by Compressing and Documenting Code
Viaarxiv icon

Pushdown Layers: Encoding Recursive Structure in Transformer Language Models

Add code
Oct 29, 2023
Figure 1 for Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Figure 2 for Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Figure 3 for Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Figure 4 for Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Viaarxiv icon

Visual Grounding Helps Learn Word Meanings in Low-Data Regimes

Add code
Oct 20, 2023
Viaarxiv icon

Eliciting Human Preferences with Language Models

Add code
Oct 17, 2023
Viaarxiv icon