Picture for Mor Geva

Mor Geva

Shammie

Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Add code
Feb 02, 2026
Viaarxiv icon

Rethinking Selective Knowledge Distillation

Add code
Feb 01, 2026
Viaarxiv icon

Detecting (Un)answerability in Large Language Models with Linear Directions

Add code
Sep 26, 2025
Viaarxiv icon

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Add code
Sep 03, 2025
Viaarxiv icon

How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?

Add code
Jun 12, 2025
Viaarxiv icon

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Add code
Jun 12, 2025
Figure 1 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 2 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 3 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 4 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Viaarxiv icon

Precise In-Parameter Concept Erasure in Large Language Models

Add code
May 28, 2025
Viaarxiv icon

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Add code
Mar 04, 2025
Figure 1 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 2 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 3 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 4 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Viaarxiv icon

Preventing Rogue Agents Improves Multi-Agent Collaboration

Add code
Feb 09, 2025
Figure 1 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 2 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 3 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 4 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Viaarxiv icon