Picture for Danush Khanna

Danush Khanna

AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)

Add code
Jun 11, 2025
Viaarxiv icon

SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations

Add code
May 27, 2025
Viaarxiv icon

DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization

Add code
Jan 08, 2025
Figure 1 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 2 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 3 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 4 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Viaarxiv icon

Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs

Add code
Sep 24, 2024
Figure 1 for Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
Figure 2 for Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
Figure 3 for Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
Figure 4 for Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
Viaarxiv icon

Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts

Add code
Jun 06, 2024
Viaarxiv icon