Picture for Eric Steinberger

Eric Steinberger

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

Add code
Jul 15, 2025
Figure 1 for Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
Viaarxiv icon

DREAM: Deep Regret minimization with Advantage baselines and Model-free learning

Add code
Jun 18, 2020
Figure 1 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 2 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 3 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 4 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Viaarxiv icon

Single Deep Counterfactual Regret Minimization

Add code
Jan 22, 2019
Figure 1 for Single Deep Counterfactual Regret Minimization
Figure 2 for Single Deep Counterfactual Regret Minimization
Figure 3 for Single Deep Counterfactual Regret Minimization
Figure 4 for Single Deep Counterfactual Regret Minimization
Viaarxiv icon