Picture for Jacob Drori

Jacob Drori

Michael Pokorny

Recontextualization Mitigates Specification Gaming without Modifying the Specification

Add code
Dec 22, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations

Add code
Oct 09, 2024
Figure 1 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 2 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 3 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 4 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Viaarxiv icon