Picture for Damiano Fornasiere

Damiano Fornasiere

Language models recognize dropout and Gaussian noise applied to their activations

Add code
Apr 19, 2026
Viaarxiv icon

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Add code
Feb 21, 2025
Viaarxiv icon

Can a Bayesian Oracle Prevent Harm from an Agent?

Add code
Aug 09, 2024
Figure 1 for Can a Bayesian Oracle Prevent Harm from an Agent?
Figure 2 for Can a Bayesian Oracle Prevent Harm from an Agent?
Viaarxiv icon