Picture for Mor Geva

Mor Geva

Shammie

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Add code
Mar 10, 2026
Viaarxiv icon

Latent Reasoning with Supervised Thinking States

Add code
Feb 09, 2026
Viaarxiv icon

From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Add code
Feb 02, 2026
Viaarxiv icon

Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Rethinking Selective Knowledge Distillation

Add code
Feb 01, 2026
Viaarxiv icon

Detecting (Un)answerability in Large Language Models with Linear Directions

Add code
Sep 26, 2025
Figure 1 for Detecting (Un)answerability in Large Language Models with Linear Directions
Figure 2 for Detecting (Un)answerability in Large Language Models with Linear Directions
Figure 3 for Detecting (Un)answerability in Large Language Models with Linear Directions
Figure 4 for Detecting (Un)answerability in Large Language Models with Linear Directions
Viaarxiv icon

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Add code
Sep 03, 2025
Figure 1 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 2 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 3 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 4 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Viaarxiv icon

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Add code
Jun 12, 2025
Figure 1 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 2 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 3 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Figure 4 for Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Viaarxiv icon

How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?

Add code
Jun 12, 2025
Viaarxiv icon

Precise In-Parameter Concept Erasure in Large Language Models

Add code
May 28, 2025
Viaarxiv icon