Alert button
Picture for Yonadav Shavit

Yonadav Shavit

Alert button

Frontier AI Regulation: Managing Emerging Risks to Public Safety

Add code
Bookmark button
Alert button
Jul 11, 2023
Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

Figure 1 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 2 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 3 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 4 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Viaarxiv icon

Tools for Verifying Neural Models' Training Data

Add code
Bookmark button
Alert button
Jul 02, 2023
Dami Choi, Yonadav Shavit, David Duvenaud

Figure 1 for Tools for Verifying Neural Models' Training Data
Figure 2 for Tools for Verifying Neural Models' Training Data
Figure 3 for Tools for Verifying Neural Models' Training Data
Figure 4 for Tools for Verifying Neural Models' Training Data
Viaarxiv icon

What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring

Add code
Bookmark button
Alert button
Mar 20, 2023
Yonadav Shavit

Figure 1 for What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring
Figure 2 for What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring
Figure 3 for What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring
Viaarxiv icon

Strengthening Subcommunities: Towards Sustainable Growth in AI Research

Add code
Bookmark button
Alert button
Apr 18, 2022
Andi Peng, Jessica Zosa Forde, Yonadav Shavit, Jonathan Frankle

Viaarxiv icon

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Add code
Bookmark button
Alert button
Feb 24, 2020
Yonadav Shavit, Benjamin Edelman, Brian Axelrod

Figure 1 for Learning From Strategic Agents: Accuracy, Improvement, and Causality
Viaarxiv icon

Extracting Incentives from Black-Box Decisions

Add code
Bookmark button
Alert button
Oct 13, 2019
Yonadav Shavit, William S. Moses

Figure 1 for Extracting Incentives from Black-Box Decisions
Figure 2 for Extracting Incentives from Black-Box Decisions
Figure 3 for Extracting Incentives from Black-Box Decisions
Figure 4 for Extracting Incentives from Black-Box Decisions
Viaarxiv icon