Picture for Ritik

Ritik

Jailbreaking for the Average Jane: Choosing Optimal Jailbreaks via Bandit Algorithms for Automatically Enhanced Queries

Add code
Jun 25, 2026
Viaarxiv icon

Inducing Robustness in a 2 Dimensional Direct Preference Optimization Paradigm

Add code
May 03, 2025
Figure 1 for Inducing Robustness in a 2 Dimensional Direct Preference Optimization Paradigm
Figure 2 for Inducing Robustness in a 2 Dimensional Direct Preference Optimization Paradigm
Figure 3 for Inducing Robustness in a 2 Dimensional Direct Preference Optimization Paradigm
Figure 4 for Inducing Robustness in a 2 Dimensional Direct Preference Optimization Paradigm
Viaarxiv icon