Picture for Rui Pu

Rui Pu

How Real is Your Jailbreak? Fine-grained Jailbreak Evaluation with Anchored Reference

Add code
Jan 04, 2026
Viaarxiv icon

LANCET: Neural Intervention via Structural Entropy for Mitigating Faithfulness Hallucinations in LLMs

Add code
Jan 04, 2026
Viaarxiv icon

MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting

Add code
Mar 17, 2025
Viaarxiv icon

Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs

Add code
Oct 18, 2024
Figure 1 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 2 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 3 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 4 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Viaarxiv icon