Picture for Matthew Kowal

Matthew Kowal

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

Add code
Feb 16, 2026
Viaarxiv icon

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering

Add code
Feb 06, 2026
Viaarxiv icon

Interpreting Physics in Video World Models

Add code
Feb 04, 2026
Viaarxiv icon

Large language models can effectively convince people to believe conspiracies

Add code
Jan 08, 2026
Viaarxiv icon

Emergent Persuasion: Will LLMs Persuade Without Being Prompted?

Add code
Dec 20, 2025
Viaarxiv icon

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Add code
Feb 18, 2025
Viaarxiv icon

Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Add code
Feb 06, 2025
Viaarxiv icon

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

Add code
Apr 10, 2024
Figure 1 for Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models
Figure 2 for Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models
Figure 3 for Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models
Figure 4 for Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models
Viaarxiv icon

Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)

Add code
Jan 23, 2024
Figure 1 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 2 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 3 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 4 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Viaarxiv icon

Understanding Video Transformers via Universal Concept Discovery

Add code
Jan 19, 2024
Viaarxiv icon