Alert button
Picture for Nick Gabrieli

Nick Gabrieli

Alert button

Safety Cases: How to Justify the Safety of Advanced AI Systems

Add code
Bookmark button
Alert button
Mar 18, 2024
Joshua Clymer, Nick Gabrieli, David Krueger, Thomas Larsen

Figure 1 for Safety Cases: How to Justify the Safety of Advanced AI Systems
Figure 2 for Safety Cases: How to Justify the Safety of Advanced AI Systems
Figure 3 for Safety Cases: How to Justify the Safety of Advanced AI Systems
Figure 4 for Safety Cases: How to Justify the Safety of Advanced AI Systems
Viaarxiv icon

Steering Llama 2 via Contrastive Activation Addition

Add code
Bookmark button
Alert button
Dec 09, 2023
Nina Rimsky, Nick Gabrieli, Julian Schulz, Meg Tong, Evan Hubinger, Alexander Matt Turner

Viaarxiv icon