Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Discovering Agents


Aug 24, 2022
Zachary Kenton, Ramana Kumar, Sebastian Farquhar, Jonathan Richens, Matt MacDermott, Tom Everitt

* Some typos corrected 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Path-Specific Objectives for Safer Agent Incentives


Apr 21, 2022
Sebastian Farquhar, Ryan Carey, Tom Everitt

* Presented at AAAI 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Complete Criterion for Value of Information in Soluble Influence Diagrams


Feb 23, 2022
Chris van Merwijk, Ryan Carey, Tom Everitt

* In Proceedings of the AAAI 2022 Conference 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness


Feb 23, 2022
Carolyn Ashurst, Ryan Carey, Silvia Chiappa, Tom Everitt

* In Proceedings of the AAAI 2022 Conference 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Shaking the foundations: delusions in sequence models for interaction and control


Oct 20, 2021
Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

* DeepMind Tech Report, 16 pages, 4 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Alignment of Language Agents


Mar 26, 2021
Zachary Kenton, Tom Everitt, Laura Weidinger, Iason Gabriel, Vladimir Mikulik, Geoffrey Irving


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

How RL Agents Behave When Their Actions Are Modified


Feb 15, 2021
Eric D. Langlois, Tom Everitt

* 10 pages (+6 appendix); 5 figures. Published in the AAAI 2021 Conference. Code is available at https://github.com/edlanglois/mamdp 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice


Feb 09, 2021
Lewis Hammond, James Fox, Tom Everitt, Alessandro Abate, Michael Wooldridge

* Accepted to the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-21) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Agent Incentives: A Causal Perspective


Feb 02, 2021
Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg

* In Proceedings of the AAAI 2021 Conference. Supersedes arXiv:1902.09980, arXiv:2001.07118 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Avoiding Tampering Incentives in Deep RL via Decoupled Approval


Nov 17, 2020
Jonathan Uesato, Ramana Kumar, Victoria Krakovna, Tom Everitt, Richard Ngo, Shane Legg


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>