Picture for David Manheim

David Manheim

Agents of Chaos

Add code
Feb 23, 2026
Viaarxiv icon

The Necessity of AI Audit Standards Boards

Add code
Apr 11, 2024
Viaarxiv icon

Modeling Transformative AI Risks (MTAIR) Project -- Summary Report

Add code
Jun 19, 2022
Figure 1 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 2 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 3 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 4 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Viaarxiv icon

Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety

Add code
Jan 09, 2022
Figure 1 for Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
Figure 2 for Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
Viaarxiv icon

Forecasting AI Progress: A Research Agenda

Add code
Aug 04, 2020
Figure 1 for Forecasting AI Progress: A Research Agenda
Figure 2 for Forecasting AI Progress: A Research Agenda
Figure 3 for Forecasting AI Progress: A Research Agenda
Figure 4 for Forecasting AI Progress: A Research Agenda
Viaarxiv icon

Oversight of Unsafe Systems via Dynamic Safety Envelopes

Add code
Nov 22, 2018
Viaarxiv icon

Overoptimization Failures and Specification Gaming in Multi-agent Systems

Add code
Oct 31, 2018
Viaarxiv icon

Categorizing Variants of Goodhart's Law

Add code
Apr 09, 2018
Viaarxiv icon