Picture for Jacob Andreas

Jacob Andreas

Adaptive Language-Guided Abstraction from Contrastive Explanations

Add code
Sep 12, 2024
Viaarxiv icon

Unforgettable Generalization in Language Models

Add code
Sep 03, 2024
Viaarxiv icon

Language Modeling with Editable External Knowledge

Add code
Jun 17, 2024
Viaarxiv icon

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

Add code
Jun 11, 2024
Viaarxiv icon

Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

Add code
May 15, 2024
Viaarxiv icon

Learning Phonotactics from Linguistic Informants

Add code
May 08, 2024
Viaarxiv icon

Policy Learning with a Language Bottleneck

Add code
May 07, 2024
Viaarxiv icon

Toward In-Context Teaching: Adapting Examples to Students' Misconceptions

Add code
May 07, 2024
Viaarxiv icon

Automatic Discovery of Visual Circuits

Add code
Apr 22, 2024
Viaarxiv icon

A Multimodal Automated Interpretability Agent

Add code
Apr 22, 2024
Viaarxiv icon