Picture for Galvin Khara

Galvin Khara

Transformers Don't Need LayerNorm at Inference Time: Scaling LayerNorm Removal to GPT-2 XL and the Implications for Mechanistic Interpretability

Add code
Jul 03, 2025
Viaarxiv icon

Robust image representations with counterfactual contrastive learning

Add code
Sep 16, 2024
Figure 1 for Robust image representations with counterfactual contrastive learning
Figure 2 for Robust image representations with counterfactual contrastive learning
Figure 3 for Robust image representations with counterfactual contrastive learning
Figure 4 for Robust image representations with counterfactual contrastive learning
Viaarxiv icon

Counterfactual contrastive learning: robust representations via causal image synthesis

Add code
Mar 14, 2024
Viaarxiv icon