Picture for Mor Geva

Mor Geva

Shammie

Don't Forget Your Embeddings: Robust Knowledge Erasure via Precise Editing of Embeddings

Add code
Jun 02, 2026
Viaarxiv icon

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Add code
May 24, 2026
Viaarxiv icon

Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts

Add code
May 12, 2026
Viaarxiv icon

Disentangling MLP Neuron Weights in Vocabulary Space

Add code
Apr 07, 2026
Viaarxiv icon

Friends and Grandmothers in Silico: Localizing Entity Cells in Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Add code
Mar 10, 2026
Viaarxiv icon

Latent Reasoning with Supervised Thinking States

Add code
Feb 09, 2026
Viaarxiv icon

Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Add code
Feb 02, 2026
Viaarxiv icon

Rethinking Selective Knowledge Distillation

Add code
Feb 01, 2026
Viaarxiv icon