Picture for Vladimir Mikulik

Vladimir Mikulik

Alignment of Language Agents

Add code
Mar 26, 2021
Viaarxiv icon

Causal Analysis of Agent Behavior for AI Safety

Add code
Mar 05, 2021
Figure 1 for Causal Analysis of Agent Behavior for AI Safety
Figure 2 for Causal Analysis of Agent Behavior for AI Safety
Figure 3 for Causal Analysis of Agent Behavior for AI Safety
Figure 4 for Causal Analysis of Agent Behavior for AI Safety
Viaarxiv icon

Algorithms for Causal Reasoning in Probability Trees

Add code
Nov 12, 2020
Figure 1 for Algorithms for Causal Reasoning in Probability Trees
Figure 2 for Algorithms for Causal Reasoning in Probability Trees
Figure 3 for Algorithms for Causal Reasoning in Probability Trees
Figure 4 for Algorithms for Causal Reasoning in Probability Trees
Viaarxiv icon

Meta-trained agents implement Bayes-optimal agents

Add code
Oct 21, 2020
Figure 1 for Meta-trained agents implement Bayes-optimal agents
Figure 2 for Meta-trained agents implement Bayes-optimal agents
Figure 3 for Meta-trained agents implement Bayes-optimal agents
Figure 4 for Meta-trained agents implement Bayes-optimal agents
Viaarxiv icon

Neural networks are a priori biased towards Boolean functions with low entropy

Add code
Sep 29, 2019
Figure 1 for Neural networks are a priori biased towards Boolean functions with low entropy
Figure 2 for Neural networks are a priori biased towards Boolean functions with low entropy
Figure 3 for Neural networks are a priori biased towards Boolean functions with low entropy
Figure 4 for Neural networks are a priori biased towards Boolean functions with low entropy
Viaarxiv icon

Risks from Learned Optimization in Advanced Machine Learning Systems

Add code
Jun 11, 2019
Figure 1 for Risks from Learned Optimization in Advanced Machine Learning Systems
Figure 2 for Risks from Learned Optimization in Advanced Machine Learning Systems
Figure 3 for Risks from Learned Optimization in Advanced Machine Learning Systems
Viaarxiv icon