Alert button

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Feb 19, 2024
Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: