Picture for Chirag Agarwal

Chirag Agarwal

Are Large Language Models Post Hoc Explainers?

Add code
Oct 10, 2023
Figure 1 for Are Large Language Models Post Hoc Explainers?
Figure 2 for Are Large Language Models Post Hoc Explainers?
Figure 3 for Are Large Language Models Post Hoc Explainers?
Figure 4 for Are Large Language Models Post Hoc Explainers?
Viaarxiv icon

On the Trade-offs between Adversarial Robustness and Actionable Explanations

Add code
Sep 28, 2023
Viaarxiv icon

Certifying LLM Safety against Adversarial Prompting

Add code
Sep 06, 2023
Figure 1 for Certifying LLM Safety against Adversarial Prompting
Figure 2 for Certifying LLM Safety against Adversarial Prompting
Figure 3 for Certifying LLM Safety against Adversarial Prompting
Figure 4 for Certifying LLM Safety against Adversarial Prompting
Viaarxiv icon

Counterfactual Explanation Policies in RL

Add code
Jul 25, 2023
Viaarxiv icon

Explaining RL Decisions with Trajectories

Add code
May 06, 2023
Figure 1 for Explaining RL Decisions with Trajectories
Figure 2 for Explaining RL Decisions with Trajectories
Figure 3 for Explaining RL Decisions with Trajectories
Figure 4 for Explaining RL Decisions with Trajectories
Viaarxiv icon

Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation

Add code
Apr 25, 2023
Figure 1 for Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation
Figure 2 for Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation
Figure 3 for Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation
Figure 4 for Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation
Viaarxiv icon

DeAR: Debiasing Vision-Language Models with Additive Residuals

Add code
Mar 18, 2023
Viaarxiv icon

GNNDelete: A General Strategy for Unlearning in Graph Neural Networks

Add code
Feb 26, 2023
Viaarxiv icon

Towards Estimating Transferability using Hard Subsets

Add code
Jan 17, 2023
Figure 1 for Towards Estimating Transferability using Hard Subsets
Figure 2 for Towards Estimating Transferability using Hard Subsets
Figure 3 for Towards Estimating Transferability using Hard Subsets
Figure 4 for Towards Estimating Transferability using Hard Subsets
Viaarxiv icon

Towards Training GNNs using Explanation Directed Message Passing

Add code
Dec 01, 2022
Viaarxiv icon