Alert button
Picture for Yonatan Belinkov

Yonatan Belinkov

Alert button

When Language Models Fall in Love: Animacy Processing in Transformer Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Michael Hanna, Yonatan Belinkov, Sandro Pezzelle

Figure 1 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 2 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 3 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Figure 4 for When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Viaarxiv icon

Unified Concept Editing in Diffusion Models

Add code
Bookmark button
Alert button
Aug 25, 2023
Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau

Figure 1 for Unified Concept Editing in Diffusion Models
Figure 2 for Unified Concept Editing in Diffusion Models
Figure 3 for Unified Concept Editing in Diffusion Models
Figure 4 for Unified Concept Editing in Diffusion Models
Viaarxiv icon

Linearity of Relation Decoding in Transformer Language Models

Add code
Bookmark button
Alert button
Aug 17, 2023
Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau

Figure 1 for Linearity of Relation Decoding in Transformer Language Models
Figure 2 for Linearity of Relation Decoding in Transformer Language Models
Figure 3 for Linearity of Relation Decoding in Transformer Language Models
Figure 4 for Linearity of Relation Decoding in Transformer Language Models
Viaarxiv icon

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

Add code
Bookmark button
Alert button
Aug 01, 2023
Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov

Figure 1 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 2 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 3 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Figure 4 for Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
Viaarxiv icon

Generating Benchmarks for Factuality Evaluation of Language Models

Add code
Bookmark button
Alert button
Jul 13, 2023
Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham

Figure 1 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 2 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 3 for Generating Benchmarks for Factuality Evaluation of Language Models
Figure 4 for Generating Benchmarks for Factuality Evaluation of Language Models
Viaarxiv icon

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder

Add code
Bookmark button
Alert button
Jun 01, 2023
Dana Arad, Hadas Orgad, Yonatan Belinkov

Figure 1 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 2 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 3 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Figure 4 for ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
Viaarxiv icon

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis

Add code
Bookmark button
Alert button
May 24, 2023
Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan

Figure 1 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 2 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 3 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 4 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Viaarxiv icon

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

Add code
Bookmark button
Alert button
May 22, 2023
Shahar Katz, Yonatan Belinkov

Figure 1 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 2 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 3 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Figure 4 for Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT
Viaarxiv icon

Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection

Add code
Bookmark button
Alert button
May 17, 2023
Shadi Iskander, Kira Radinsky, Yonatan Belinkov

Figure 1 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 2 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 3 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Figure 4 for Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection
Viaarxiv icon

ContraSim -- A Similarity Measure Based on Contrastive Learning

Add code
Bookmark button
Alert button
Mar 29, 2023
Adir Rahamim, Yonatan Belinkov

Figure 1 for ContraSim -- A Similarity Measure Based on Contrastive Learning
Figure 2 for ContraSim -- A Similarity Measure Based on Contrastive Learning
Figure 3 for ContraSim -- A Similarity Measure Based on Contrastive Learning
Figure 4 for ContraSim -- A Similarity Measure Based on Contrastive Learning
Viaarxiv icon