Alert button
Picture for Bo Li

Bo Li

Alert button

A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation

Feb 29, 2024
Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wang

Viaarxiv icon

Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment

Feb 27, 2024
Jiongxiao Wang, Jiazhao Li, Yiquan Li, Xiangyu Qi, Junjie Hu, Yixuan Li, Patrick McDaniel, Muhao Chen, Bo Li, Chaowei Xiao

Viaarxiv icon

DART: Depth-Enhanced Accurate and Real-Time Background Matting

Feb 24, 2024
Hanxi Li, Guofeng Li, Bo Li, Lin Wu, Yan Cheng

Viaarxiv icon

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Feb 22, 2024
Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

Viaarxiv icon

Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation

Feb 20, 2024
Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland

Viaarxiv icon

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models

Feb 12, 2024
Mintong Kang, Nezihe Merve Gürel, Ning Yu, Dawn Song, Bo Li

Viaarxiv icon

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

Feb 12, 2024
Dinuka Sahabandu, Xiaojun Xu, Arezoo Rajabi, Luyao Niu, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

Viaarxiv icon

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Feb 06, 2024
Mantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li, David Forsyth, Dan Hendrycks

Viaarxiv icon