Alert button
Picture for Pei Ke

Pei Ke

Alert button

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Add code
Bookmark button
Alert button
Feb 26, 2024
Zhexin Zhang, Yida Lu, Jingyuan Ma, Di Zhang, Rui Li, Pei Ke, Hao Sun, Lei Sha, Zhifang Sui, Hongning Wang, Minlie Huang

Viaarxiv icon

Towards Efficient and Exact Optimization of Language Model Alignment

Add code
Bookmark button
Alert button
Feb 02, 2024
Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

Viaarxiv icon

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Add code
Bookmark button
Alert button
Dec 05, 2023
Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang

Viaarxiv icon

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

Viaarxiv icon

Unveiling the Implicit Toxicity in Large Language Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Jiaxin Wen, Pei Ke, Hao Sun, Zhexin Zhang, Chengfei Li, Jinfeng Bai, Minlie Huang

Viaarxiv icon

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

Add code
Bookmark button
Alert button
Nov 15, 2023
Zhexin Zhang, Junxiao Yang, Pei Ke, Minlie Huang

Viaarxiv icon

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Add code
Bookmark button
Alert button
Nov 08, 2023
Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang

Viaarxiv icon

Language Model Decoding as Direct Metrics Optimization

Add code
Bookmark button
Alert button
Oct 02, 2023
Haozhe Ji, Pei Ke, Hongning Wang, Minlie Huang

Figure 1 for Language Model Decoding as Direct Metrics Optimization
Figure 2 for Language Model Decoding as Direct Metrics Optimization
Figure 3 for Language Model Decoding as Direct Metrics Optimization
Figure 4 for Language Model Decoding as Direct Metrics Optimization
Viaarxiv icon