Alert button
Picture for Yang Zhang

Yang Zhang

Alert button

Prompt Stealing Attacks Against Large Language Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Zeyang Sha, Yang Zhang

Viaarxiv icon

Are LLM-based Evaluators Confusing NLG Quality Criteria?

Add code
Bookmark button
Alert button
Feb 19, 2024
Xinyu Hu, Mingqi Gao, Sen Hu, Yang Zhang, Yicheng Chen, Teng Xu, Xiaojun Wan

Viaarxiv icon

Rapid Adoption, Hidden Risks: The Dual Impact of Large Language Model Customization

Add code
Bookmark button
Alert button
Feb 15, 2024
Rui Zhang, Hongwei Li, Rui Wen, Wenbo Jiang, Yuan Zhang, Michael Backes, Yun Shen, Yang Zhang

Viaarxiv icon

Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection

Add code
Bookmark button
Alert button
Feb 14, 2024
Pengfei Zhou, Weiqing Min, Jiajun Song, Yang Zhang, Shuqiang Jiang

Viaarxiv icon

PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Preference Alignment

Add code
Bookmark button
Alert button
Feb 13, 2024
Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan

Viaarxiv icon

Comprehensive Assessment of Jailbreak Attacks Against LLMs

Add code
Bookmark button
Alert button
Feb 08, 2024
Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang

Viaarxiv icon

Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

Add code
Bookmark button
Alert button
Feb 08, 2024
Haotong Qin, Xudong Ma, Xingyu Zheng, Xiaoyang Li, Yang Zhang, Shouda Liu, Jie Luo, Xianglong Liu, Michele Magno

Viaarxiv icon

GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models

Add code
Bookmark button
Alert button
Feb 05, 2024
Haibo Jin, Ruoxi Chen, Andy Zhou, Jinyin Chen, Yang Zhang, Haohan Wang

Viaarxiv icon

Conversation Reconstruction Attack Against GPT Models

Add code
Bookmark button
Alert button
Feb 05, 2024
Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang

Viaarxiv icon

Secure Wireless Communication in Active RIS-Assisted DFRC System

Add code
Bookmark button
Alert button
Feb 03, 2024
Yang Zhang, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Tuo Wu, Yongchao He

Viaarxiv icon