Alert button
Picture for David Krueger

David Krueger

Alert button

Towards Out-of-Distribution Adversarial Robustness

Add code
Bookmark button
Alert button
Oct 10, 2022
Adam Ibrahim, Charles Guille-Escuret, Ioannis Mitliagkas, Irina Rish, David Krueger, Pouya Bashivan

Figure 1 for Towards Out-of-Distribution Adversarial Robustness
Figure 2 for Towards Out-of-Distribution Adversarial Robustness
Figure 3 for Towards Out-of-Distribution Adversarial Robustness
Figure 4 for Towards Out-of-Distribution Adversarial Robustness
Viaarxiv icon

Defining and Characterizing Reward Hacking

Add code
Bookmark button
Alert button
Sep 27, 2022
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger

Figure 1 for Defining and Characterizing Reward Hacking
Figure 2 for Defining and Characterizing Reward Hacking
Figure 3 for Defining and Characterizing Reward Hacking
Figure 4 for Defining and Characterizing Reward Hacking
Viaarxiv icon

Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics

Add code
Bookmark button
Alert button
Sep 20, 2022
Shoaib Ahmed Siddiqui, Nitarshan Rajkumar, Tegan Maharaj, David Krueger, Sara Hooker

Figure 1 for Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Figure 2 for Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Figure 3 for Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Figure 4 for Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Viaarxiv icon

Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

Add code
Bookmark button
Alert button
Dec 28, 2021
Enoch Tetteh, Joseph Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen

Figure 1 for Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models
Figure 2 for Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models
Viaarxiv icon

Multi-Domain Balanced Sampling Improves Out-of-Generalization of Chest X-ray Pathology Prediction Models

Add code
Bookmark button
Alert button
Dec 27, 2021
Enoch Tetteh, Joseph Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen

Figure 1 for Multi-Domain Balanced Sampling Improves Out-of-Generalization of Chest X-ray Pathology Prediction Models
Figure 2 for Multi-Domain Balanced Sampling Improves Out-of-Generalization of Chest X-ray Pathology Prediction Models
Viaarxiv icon

Filling gaps in trustworthy development of AI

Add code
Bookmark button
Alert button
Dec 14, 2021
Shahar Avin, Haydn Belfield, Miles Brundage, Gretchen Krueger, Jasmine Wang, Adrian Weller, Markus Anderljung, Igor Krawczuk, David Krueger, Jonathan Lebensold, Tegan Maharaj, Noa Zilberman

Viaarxiv icon

Active Reinforcement Learning: Observing Rewards at a Cost

Add code
Bookmark button
Alert button
Nov 24, 2020
David Krueger, Jan Leike, Owain Evans, John Salvatier

Figure 1 for Active Reinforcement Learning: Observing Rewards at a Cost
Figure 2 for Active Reinforcement Learning: Observing Rewards at a Cost
Figure 3 for Active Reinforcement Learning: Observing Rewards at a Cost
Figure 4 for Active Reinforcement Learning: Observing Rewards at a Cost
Viaarxiv icon

Hidden Incentives for Auto-Induced Distributional Shift

Add code
Bookmark button
Alert button
Sep 19, 2020
David Krueger, Tegan Maharaj, Jan Leike

Figure 1 for Hidden Incentives for Auto-Induced Distributional Shift
Figure 2 for Hidden Incentives for Auto-Induced Distributional Shift
Figure 3 for Hidden Incentives for Auto-Induced Distributional Shift
Figure 4 for Hidden Incentives for Auto-Induced Distributional Shift
Viaarxiv icon

AI Research Considerations for Human Existential Safety (ARCHES)

Add code
Bookmark button
Alert button
May 30, 2020
Andrew Critch, David Krueger

Figure 1 for AI Research Considerations for Human Existential Safety (ARCHES)
Figure 2 for AI Research Considerations for Human Existential Safety (ARCHES)
Figure 3 for AI Research Considerations for Human Existential Safety (ARCHES)
Figure 4 for AI Research Considerations for Human Existential Safety (ARCHES)
Viaarxiv icon