Picture for Luhan Mikaelson

Luhan Mikaelson

Beyond Mimicry: Preference Coherence in LLMs

Add code
Nov 17, 2025
Viaarxiv icon

Self-Ablating Transformers: More Interpretability, Less Sparsity

Add code
May 01, 2025
Figure 1 for Self-Ablating Transformers: More Interpretability, Less Sparsity
Figure 2 for Self-Ablating Transformers: More Interpretability, Less Sparsity
Figure 3 for Self-Ablating Transformers: More Interpretability, Less Sparsity
Figure 4 for Self-Ablating Transformers: More Interpretability, Less Sparsity
Viaarxiv icon