Alert button
Picture for Yonatan Belinkov

Yonatan Belinkov

Alert button

Accelerating the Global Aggregation of Local Explanations

Dec 23, 2023
Alon Mor, Yonatan Belinkov, Benny Kimelfeld

Viaarxiv icon

When Language Models Fall in Love: Animacy Processing in Transformer Language Models

Oct 23, 2023
Michael Hanna, Yonatan Belinkov, Sandro Pezzelle

Figure 1 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 2 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 3 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 4 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Viaarxiv icon

Unified Concept Editing in Diffusion Models

Aug 25, 2023
Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau

Figure 1 for Unified Concept Editing in Diffusion Models
Figure 2 for Unified Concept Editing in Diffusion Models
Figure 3 for Unified Concept Editing in Diffusion Models
Figure 4 for Unified Concept Editing in Diffusion Models
Viaarxiv icon

Linearity of Relation Decoding in Transformer Language Models

Aug 17, 2023
Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau

Figure 1 for Linearity of Relation Decoding in Transformer Language Models
Figure 2 for Linearity of Relation Decoding in Transformer Language Models
Figure 3 for Linearity of Relation Decoding in Transformer Language Models
Figure 4 for Linearity of Relation Decoding in Transformer Language Models
Viaarxiv icon

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

Aug 01, 2023
Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov

Figure 1 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 2 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 3 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 4 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Viaarxiv icon

Generating Benchmarks for Factuality Evaluation of Language Models

Jul 13, 2023
Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham

Figure 1 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 2 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 3 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 4 for Generating Benchmarks for Factuality Evaluation of Language Models
Viaarxiv icon

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder

Jun 01, 2023
Dana Arad, Hadas Orgad, Yonatan Belinkov

Figure 1 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 2 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 3 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 4 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Viaarxiv icon

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis

May 24, 2023
Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan

Figure 1 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 2 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 3 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 4 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Viaarxiv icon

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

May 22, 2023
Shahar Katz, Yonatan Belinkov

Figure 1 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 2 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 3 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 4 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Viaarxiv icon

Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection

May 17, 2023
Shadi Iskander, Kira Radinsky, Yonatan Belinkov

Figure 1 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 2 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 3 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 4 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Viaarxiv icon