Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Vladimir Mikulik

Alignment of Language Agents


Mar 26, 2021
Zachary Kenton, Tom Everitt, Laura Weidinger, Iason Gabriel, Vladimir Mikulik, Geoffrey Irving


  Access Paper or Ask Questions

Causal Analysis of Agent Behavior for AI Safety


Mar 05, 2021
Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

* 16 pages, 16 figures, 6 tables 

  Access Paper or Ask Questions

Algorithms for Causal Reasoning in Probability Trees


Nov 12, 2020
Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

* (2nd version with correction to algorithm) 11 pages, 8 figures, 5 algorithms. A companion Colaboratory tutorial is available at https://github.com/deepmind/deepmind-research/tree/master/causal_reasoning 

  Access Paper or Ask Questions

Meta-trained agents implement Bayes-optimal agents


Oct 21, 2020
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

* Published at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada 

  Access Paper or Ask Questions

Neural networks are a priori biased towards Boolean functions with low entropy


Sep 29, 2019
Chris Mingard, Joar Skalse, Guillermo Valle-PĂ©rez, David MartĂ­nez-Rubio, Vladimir Mikulik, Ard A. Louis

* Under review as a conference paper at ICLR 2020 

  Access Paper or Ask Questions

Neural networks are $\textit{a priori}$ biased towards Boolean functions with low entropy


Sep 25, 2019
Chris Mingard, Joar Skalse, Guillermo Valle-PĂ©rez, David MartĂ­nez-Rubio, Vladimir Mikulik, Ard A. Louis

* Under review as a conference paper at ICLR 2020 

  Access Paper or Ask Questions

Risks from Learned Optimization in Advanced Machine Learning Systems


Jun 11, 2019
Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, Scott Garrabrant


  Access Paper or Ask Questions