Picture for Shojiro Yamabe

Shojiro Yamabe

Understanding Sensitivity of Differential Attention through the Lens of Adversarial Robustness

Add code
Oct 01, 2025
Viaarxiv icon

Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

Add code
Oct 01, 2025
Viaarxiv icon

MergePrint: Robust Fingerprinting against Merging Large Language Models

Add code
Oct 11, 2024
Figure 1 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 2 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 3 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 4 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Viaarxiv icon

Behavior-Targeted Attack on Reinforcement Learning with Limited Access to Victim's Policy

Add code
Jun 06, 2024
Viaarxiv icon