Picture for Katja Filippova

Katja Filippova

Google Research

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

Add code
Dec 09, 2024
Figure 1 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 2 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 3 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 4 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Viaarxiv icon

Theoretical and Practical Perspectives on what Influence Functions Do

Add code
May 26, 2023
Figure 1 for Theoretical and Practical Perspectives on what Influence Functions Do
Figure 2 for Theoretical and Practical Perspectives on what Influence Functions Do
Figure 3 for Theoretical and Practical Perspectives on what Influence Functions Do
Figure 4 for Theoretical and Practical Perspectives on what Influence Functions Do
Viaarxiv icon

Dissecting Recall of Factual Associations in Auto-Regressive Language Models

Add code
Apr 28, 2023
Figure 1 for Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Figure 2 for Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Figure 3 for Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Figure 4 for Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Viaarxiv icon

Make Every Example Count: On Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets

Add code
Feb 27, 2023
Viaarxiv icon

Understanding Text Classification Data and Models Using Aggregated Input Salience

Add code
Nov 11, 2022
Figure 1 for Understanding Text Classification Data and Models Using Aggregated Input Salience
Figure 2 for Understanding Text Classification Data and Models Using Aggregated Input Salience
Figure 3 for Understanding Text Classification Data and Models Using Aggregated Input Salience
Figure 4 for Understanding Text Classification Data and Models Using Aggregated Input Salience
Viaarxiv icon

Diagnosing AI Explanation Methods with Folk Concepts of Behavior

Add code
Jan 27, 2022
Figure 1 for Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Figure 2 for Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Figure 3 for Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Figure 4 for Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Viaarxiv icon

"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification

Add code
Nov 14, 2021
Figure 1 for "Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Figure 2 for "Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Figure 3 for "Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Figure 4 for "Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Viaarxiv icon

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

Add code
Oct 12, 2020
Figure 1 for Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data
Figure 2 for Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data
Figure 3 for Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data
Figure 4 for Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data
Viaarxiv icon

The elephant in the interpretability room: Why use attention as explanation when we have saliency methods?

Add code
Oct 12, 2020
Figure 1 for The elephant in the interpretability room: Why use attention as explanation when we have saliency methods?
Viaarxiv icon

We Need to Talk About Random Splits

Add code
May 01, 2020
Figure 1 for We Need to Talk About Random Splits
Figure 2 for We Need to Talk About Random Splits
Figure 3 for We Need to Talk About Random Splits
Figure 4 for We Need to Talk About Random Splits
Viaarxiv icon