Alert button
Picture for Cassidy Laidlaw

Cassidy Laidlaw

Alert button

Preventing Reward Hacking with Occupancy Measure Regularization

Add code
Bookmark button
Alert button
Mar 05, 2024
Cassidy Laidlaw, Shivam Singhal, Anca Dragan

Figure 1 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 2 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 3 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 4 for Preventing Reward Hacking with Occupancy Measure Regularization
Viaarxiv icon

Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Add code
Bookmark button
Alert button
Dec 18, 2023
Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

Viaarxiv icon

The Effective Horizon Explains Deep RL Performance in Stochastic Environments

Add code
Bookmark button
Alert button
Dec 13, 2023
Cassidy Laidlaw, Banghua Zhu, Stuart Russell, Anca Dragan

Viaarxiv icon

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF

Add code
Bookmark button
Alert button
Dec 13, 2023
Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell

Viaarxiv icon

Bridging RL Theory and Practice with the Effective Horizon

Add code
Bookmark button
Alert button
Apr 19, 2023
Cassidy Laidlaw, Stuart Russell, Anca Dragan

Figure 1 for Bridging RL Theory and Practice with the Effective Horizon
Figure 2 for Bridging RL Theory and Practice with the Effective Horizon
Figure 3 for Bridging RL Theory and Practice with the Effective Horizon
Figure 4 for Bridging RL Theory and Practice with the Effective Horizon
Viaarxiv icon

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

Add code
Bookmark button
Alert button
Apr 22, 2022
Cassidy Laidlaw, Anca Dragan

Figure 1 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 2 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 3 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 4 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Viaarxiv icon

Learning the Preferences of Uncertain Humans with Inverse Decision Theory

Add code
Bookmark button
Alert button
Jun 19, 2021
Cassidy Laidlaw, Stuart Russell

Figure 1 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 2 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 3 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 4 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Viaarxiv icon

Perceptual Adversarial Robustness: Defense Against Unseen Threat Models

Add code
Bookmark button
Alert button
Jun 22, 2020
Cassidy Laidlaw, Sahil Singla, Soheil Feizi

Figure 1 for Perceptual Adversarial Robustness: Defense Against Unseen Threat Models
Figure 2 for Perceptual Adversarial Robustness: Defense Against Unseen Threat Models
Figure 3 for Perceptual Adversarial Robustness: Defense Against Unseen Threat Models
Figure 4 for Perceptual Adversarial Robustness: Defense Against Unseen Threat Models
Viaarxiv icon

Playing it Safe: Adversarial Robustness with an Abstain Option

Add code
Bookmark button
Alert button
Nov 25, 2019
Cassidy Laidlaw, Soheil Feizi

Figure 1 for Playing it Safe: Adversarial Robustness with an Abstain Option
Figure 2 for Playing it Safe: Adversarial Robustness with an Abstain Option
Figure 3 for Playing it Safe: Adversarial Robustness with an Abstain Option
Figure 4 for Playing it Safe: Adversarial Robustness with an Abstain Option
Viaarxiv icon

Functional Adversarial Attacks

Add code
Bookmark button
Alert button
May 29, 2019
Cassidy Laidlaw, Soheil Feizi

Figure 1 for Functional Adversarial Attacks
Figure 2 for Functional Adversarial Attacks
Figure 3 for Functional Adversarial Attacks
Figure 4 for Functional Adversarial Attacks
Viaarxiv icon