Picture for Jan Chojnacki

Jan Chojnacki

Interpretable Risk Mitigation in LLM Agent Systems

Add code
May 15, 2025
Viaarxiv icon

NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method

Add code
Mar 27, 2024
Figure 1 for NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method
Figure 2 for NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method
Figure 3 for NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method
Figure 4 for NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method
Viaarxiv icon