Picture for Vikram Natarajan

Vikram Natarajan

Building Better Deception Probes Using Targeted Instruction Pairs

Add code
Feb 01, 2026
Viaarxiv icon

Mechanistic Decomposition of Sentence Representations

Add code
Jun 04, 2025
Figure 1 for Mechanistic Decomposition of Sentence Representations
Figure 2 for Mechanistic Decomposition of Sentence Representations
Figure 3 for Mechanistic Decomposition of Sentence Representations
Figure 4 for Mechanistic Decomposition of Sentence Representations
Viaarxiv icon