Alert button
Picture for Keegan Hines

Keegan Hines

Alert button

Defending Against Indirect Prompt Injection Attacks With Spotlighting

Add code
Bookmark button
Alert button
Mar 20, 2024
Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman

Viaarxiv icon

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

Add code
Bookmark button
Alert button
Dec 21, 2023
Jingwei Yi, Yueqi Xie, Bin Zhu, Keegan Hines, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu

Viaarxiv icon

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

Add code
Bookmark button
Alert button
Mar 23, 2023
Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson

Figure 1 for Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Figure 2 for Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Figure 3 for Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Figure 4 for Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Viaarxiv icon

Achieving Downstream Fairness with Geometric Repair

Add code
Bookmark button
Alert button
Mar 14, 2022
Kweku Kwegyir-Aggrey, Jessica Dai, John Dickerson, Keegan Hines

Figure 1 for Achieving Downstream Fairness with Geometric Repair
Figure 2 for Achieving Downstream Fairness with Geometric Repair
Figure 3 for Achieving Downstream Fairness with Geometric Repair
Figure 4 for Achieving Downstream Fairness with Geometric Repair
Viaarxiv icon

Counterfactual Explanations for Machine Learning: Challenges Revisited

Add code
Bookmark button
Alert button
Jun 14, 2021
Sahil Verma, John Dickerson, Keegan Hines

Figure 1 for Counterfactual Explanations for Machine Learning: Challenges Revisited
Viaarxiv icon

Amortized Generation of Sequential Counterfactual Explanations for Black-box Models

Add code
Bookmark button
Alert button
Jun 07, 2021
Sahil Verma, Keegan Hines, John P. Dickerson

Figure 1 for Amortized Generation of Sequential Counterfactual Explanations for Black-box Models
Figure 2 for Amortized Generation of Sequential Counterfactual Explanations for Black-box Models
Figure 3 for Amortized Generation of Sequential Counterfactual Explanations for Black-box Models
Figure 4 for Amortized Generation of Sequential Counterfactual Explanations for Black-box Models
Viaarxiv icon

Counterfactual Explanations for Machine Learning: A Review

Add code
Bookmark button
Alert button
Oct 20, 2020
Sahil Verma, John Dickerson, Keegan Hines

Figure 1 for Counterfactual Explanations for Machine Learning: A Review
Figure 2 for Counterfactual Explanations for Machine Learning: A Review
Viaarxiv icon

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Add code
Bookmark button
Alert button
Sep 03, 2019
Anh Truong, Austin Walters, Jeremy Goodsitt, Keegan Hines, C. Bayan Bruss, Reza Farivar

Figure 1 for Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
Figure 2 for Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
Figure 3 for Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
Figure 4 for Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
Viaarxiv icon