Alert button
Picture for Zhihao Xu

Zhihao Xu

Alert button

Uncovering Safety Risks in Open-source LLMs through Concept Activation Vector

Add code
Bookmark button
Alert button
Apr 18, 2024
Zhihao Xu, Ruixuan Huang, Xiting Wang, Fangzhao Wu, Jing Yao, Xing Xie

Viaarxiv icon