Picture for Finale Doshi-Velez

Finale Doshi-Velez

Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

Add code
Jan 26, 2024
Viaarxiv icon

Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Add code
Dec 18, 2023
Viaarxiv icon

Signature Activation: A Sparse Signal View for Holistic Saliency

Add code
Sep 20, 2023
Viaarxiv icon

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Add code
Sep 01, 2023
Viaarxiv icon

Bayesian Inverse Transition Learning for Offline Settings

Add code
Aug 09, 2023
Viaarxiv icon

SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

Add code
Jul 28, 2023
Viaarxiv icon

On the Effective Horizon of Inverse Reinforcement Learning

Add code
Jul 13, 2023
Figure 1 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 2 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 3 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 4 for On the Effective Horizon of Inverse Reinforcement Learning
Viaarxiv icon

Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

Add code
Jun 22, 2023
Viaarxiv icon

The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

Add code
Jun 20, 2023
Figure 1 for The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
Figure 2 for The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
Figure 3 for The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
Figure 4 for The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
Viaarxiv icon

Adaptive interventions for both accuracy and time in AI-assisted human decision making

Add code
Jun 12, 2023
Viaarxiv icon