Alert button
Picture for Michiel A. Bakker

Michiel A. Bakker

Alert button

Fine-tuning language models to find agreement among humans with diverse preferences

Nov 28, 2022
Michiel A. Bakker, Martin J. Chadwick, Hannah R. Sheahan, Michael Henry Tessler, Lucy Campbell-Gillingham, Jan Balaguer, Nat McAleese, Amelia Glaese, John Aslanides, Matthew M. Botvinick, Christopher Summerfield

Figure 1 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 2 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 3 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 4 for Fine-tuning language models to find agreement among humans with diverse preferences
Viaarxiv icon

Statistical discrimination in learning agents

Oct 21, 2021
Edgar A. Duéñez-Guzmán, Kevin R. McKee, Yiran Mao, Ben Coppin, Silvia Chiappa, Alexander Sasha Vezhnevets, Michiel A. Bakker, Yoram Bachrach, Suzanne Sadedin, William Isaac, Karl Tuyls, Joel Z. Leibo

Figure 1 for Statistical discrimination in learning agents
Figure 2 for Statistical discrimination in learning agents
Figure 3 for Statistical discrimination in learning agents
Figure 4 for Statistical discrimination in learning agents
Viaarxiv icon

Modelling Cooperation in Network Games with Spatio-Temporal Complexity

Feb 13, 2021
Michiel A. Bakker, Richard Everett, Laura Weidinger, Iason Gabriel, William S. Isaac, Joel Z. Leibo, Edward Hughes

Figure 1 for Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Figure 2 for Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Figure 3 for Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Figure 4 for Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Viaarxiv icon

DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning

Oct 30, 2019
Michiel A. Bakker, Duy Patrick Tu, Humberto Riverón Valdés, Krishna P. Gummadi, Kush R. Varshney, Adrian Weller, Alex Pentland

Figure 1 for DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning
Figure 2 for DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning
Viaarxiv icon

VizML: A Machine Learning Approach to Visualization Recommendation

Aug 14, 2018
Kevin Z. Hu, Michiel A. Bakker, Stephen Li, Tim Kraska, César A. Hidalgo

Figure 1 for VizML: A Machine Learning Approach to Visualization Recommendation
Figure 2 for VizML: A Machine Learning Approach to Visualization Recommendation
Figure 3 for VizML: A Machine Learning Approach to Visualization Recommendation
Figure 4 for VizML: A Machine Learning Approach to Visualization Recommendation
Viaarxiv icon