Picture for Rita Sevastjanova

Rita Sevastjanova

Concept-Level Explainability for Auditing & Steering LLM Responses

Add code
May 12, 2025
Viaarxiv icon

LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Add code
Apr 09, 2025
Viaarxiv icon

Feature Clock: High-Dimensional Effects in Two-Dimensional Plots

Add code
Aug 06, 2024
Figure 1 for Feature Clock: High-Dimensional Effects in Two-Dimensional Plots
Figure 2 for Feature Clock: High-Dimensional Effects in Two-Dimensional Plots
Figure 3 for Feature Clock: High-Dimensional Effects in Two-Dimensional Plots
Figure 4 for Feature Clock: High-Dimensional Effects in Two-Dimensional Plots
Viaarxiv icon

Challenges and Opportunities in Text Generation Explainability

Add code
May 14, 2024
Viaarxiv icon

generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation

Add code
Mar 12, 2024
Figure 1 for generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Figure 2 for generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Figure 3 for generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Figure 4 for generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Viaarxiv icon

SyntaxShap: Syntax-aware Explainability Method for Text Generation

Add code
Feb 14, 2024
Viaarxiv icon

Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges

Add code
Oct 17, 2023
Figure 1 for Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges
Figure 2 for Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges
Figure 3 for Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges
Figure 4 for Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges
Viaarxiv icon

Negation, Coordination, and Quantifiers in Contextualized Language Models

Add code
Sep 16, 2022
Figure 1 for Negation, Coordination, and Quantifiers in Contextualized Language Models
Figure 2 for Negation, Coordination, and Quantifiers in Contextualized Language Models
Figure 3 for Negation, Coordination, and Quantifiers in Contextualized Language Models
Figure 4 for Negation, Coordination, and Quantifiers in Contextualized Language Models
Viaarxiv icon

Visual Comparison of Language Model Adaptation

Add code
Aug 17, 2022
Figure 1 for Visual Comparison of Language Model Adaptation
Figure 2 for Visual Comparison of Language Model Adaptation
Figure 3 for Visual Comparison of Language Model Adaptation
Figure 4 for Visual Comparison of Language Model Adaptation
Viaarxiv icon

Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language

Add code
Jul 14, 2022
Figure 1 for Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language
Figure 2 for Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language
Figure 3 for Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language
Viaarxiv icon