Picture for Huizhen Shu

Huizhen Shu

Layer-Wise Perturbations via Sparse Autoencoders for Adversarial Text Generation

Add code
Aug 14, 2025
Viaarxiv icon