Picture for Sidharth Baskaran

Sidharth Baskaran

Anatomy of Post-Training: Using Interpretability to Characterize Data and Shape the Learning Signal

Add code
Jun 10, 2026
Viaarxiv icon

HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks

Add code
Mar 13, 2025
Figure 1 for HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Figure 2 for HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Figure 3 for HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Figure 4 for HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Viaarxiv icon