Alert button
Picture for Finale Doshi-Velez

Finale Doshi-Velez

Alert button

Non-Stationary Latent Auto-Regressive Bandits

Feb 05, 2024
Anna L. Trella, Walter Dempsey, Finale Doshi-Velez, Susan A. Murphy

Viaarxiv icon

Semi-parametric Expert Bayesian Network Learning with Gaussian Processes and Horseshoe Priors

Jan 29, 2024
Yidou Weng, Finale Doshi-Velez

Viaarxiv icon

Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

Jan 26, 2024
Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

Viaarxiv icon

Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Dec 18, 2023
Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

Viaarxiv icon

Signature Activation: A Sparse Signal View for Holistic Saliency

Sep 20, 2023
Jose Roberto Tello Ayala, Akl C. Fahed, Weiwei Pan, Eugene V. Pomerantsev, Patrick T. Ellinor, Anthony Philippakis, Finale Doshi-Velez

Figure 1 for Signature Activation: A Sparse Signal View for Holistic Saliency
Figure 2 for Signature Activation: A Sparse Signal View for Holistic Saliency
Figure 3 for Signature Activation: A Sparse Signal View for Holistic Saliency
Figure 4 for Signature Activation: A Sparse Signal View for Holistic Saliency
Viaarxiv icon

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Sep 01, 2023
Varshini Subhash, Anna Bialas, Weiwei Pan, Finale Doshi-Velez

Viaarxiv icon

Bayesian Inverse Transition Learning for Offline Settings

Aug 09, 2023
Leo Benac, Sonali Parbhoo, Finale Doshi-Velez

Figure 1 for Bayesian Inverse Transition Learning for Offline Settings
Figure 2 for Bayesian Inverse Transition Learning for Offline Settings
Viaarxiv icon

SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

Jul 28, 2023
Charumathi Badrinath, Weiwei Pan, Finale Doshi-Velez

Figure 1 for SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text
Figure 2 for SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text
Figure 3 for SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text
Figure 4 for SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text
Viaarxiv icon

On the Effective Horizon of Inverse Reinforcement Learning

Jul 13, 2023
Yiqing Xu, Finale Doshi-Velez, David Hsu

Figure 1 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 2 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 3 for On the Effective Horizon of Inverse Reinforcement Learning
Figure 4 for On the Effective Horizon of Inverse Reinforcement Learning
Viaarxiv icon

Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

Jun 22, 2023
Xudong Shen, Hannah Brown, Jiashu Tao, Martin Strobel, Yao Tong, Akshay Narayan, Harold Soh, Finale Doshi-Velez

Viaarxiv icon