Picture for Yonatan Belinkov

Yonatan Belinkov

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Add code
Mar 26, 2024
Viaarxiv icon

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

Add code
Mar 17, 2024
Viaarxiv icon

Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Add code
Mar 14, 2024
Figure 1 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 2 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 3 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 4 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Viaarxiv icon

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

Add code
Mar 09, 2024
Figure 1 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 2 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 3 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 4 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Viaarxiv icon

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

Add code
Feb 27, 2024
Figure 1 for A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
Figure 2 for A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
Figure 3 for A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
Figure 4 for A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
Viaarxiv icon

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

Add code
Feb 22, 2024
Figure 1 for Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
Figure 2 for Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
Figure 3 for Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
Figure 4 for Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
Viaarxiv icon

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Add code
Feb 20, 2024
Figure 1 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 2 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 3 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 4 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Viaarxiv icon

Accelerating the Global Aggregation of Local Explanations

Add code
Dec 23, 2023
Figure 1 for Accelerating the Global Aggregation of Local Explanations
Figure 2 for Accelerating the Global Aggregation of Local Explanations
Figure 3 for Accelerating the Global Aggregation of Local Explanations
Figure 4 for Accelerating the Global Aggregation of Local Explanations
Viaarxiv icon

When Language Models Fall in Love: Animacy Processing in Transformer Language Models

Add code
Oct 23, 2023
Viaarxiv icon

Unified Concept Editing in Diffusion Models

Add code
Aug 25, 2023
Figure 1 for Unified Concept Editing in Diffusion Models
Figure 2 for Unified Concept Editing in Diffusion Models
Figure 3 for Unified Concept Editing in Diffusion Models
Figure 4 for Unified Concept Editing in Diffusion Models
Viaarxiv icon