Picture for Alexander Boyd

Alexander Boyd

AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability

Add code
Apr 06, 2025
Figure 1 for AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability
Figure 2 for AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability
Figure 3 for AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability
Figure 4 for AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability
Viaarxiv icon