Picture for Yonatan Belinkov

Yonatan Belinkov

Linearity of Relation Decoding in Transformer Language Models

Add code
Aug 17, 2023
Viaarxiv icon

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

Add code
Aug 01, 2023
Viaarxiv icon

Generating Benchmarks for Factuality Evaluation of Language Models

Add code
Jul 13, 2023
Figure 1 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 2 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 3 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 4 for Generating Benchmarks for Factuality Evaluation of Language Models
Viaarxiv icon

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder

Add code
Jun 01, 2023
Viaarxiv icon

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis

Add code
May 24, 2023
Figure 1 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 2 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 3 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 4 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Viaarxiv icon

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

Add code
May 22, 2023
Figure 1 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 2 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 3 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 4 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Viaarxiv icon

Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection

Add code
May 17, 2023
Figure 1 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 2 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 3 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 4 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Viaarxiv icon

ContraSim -- A Similarity Measure Based on Contrastive Learning

Add code
Mar 29, 2023
Viaarxiv icon

Editing Implicit Assumptions in Text-to-Image Diffusion Models

Add code
Mar 14, 2023
Viaarxiv icon

Parallel Context Windows Improve In-Context Learning of Large Language Models

Add code
Dec 21, 2022
Figure 1 for Parallel Context Windows Improve In-Context Learning of Large Language Models
Figure 2 for Parallel Context Windows Improve In-Context Learning of Large Language Models
Figure 3 for Parallel Context Windows Improve In-Context Learning of Large Language Models
Figure 4 for Parallel Context Windows Improve In-Context Learning of Large Language Models
Viaarxiv icon