Alert button
Picture for Zhanhui Zhou

Zhanhui Zhou

Alert button

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

Add code
Bookmark button
Alert button
Feb 23, 2024
Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng

Viaarxiv icon

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

Add code
Bookmark button
Alert button
Feb 22, 2024
Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

Viaarxiv icon

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

Add code
Bookmark button
Alert button
Feb 21, 2024
Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao

Viaarxiv icon

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

Add code
Bookmark button
Alert button
Feb 14, 2024
Zhichen Dong, Zhanhui Zhou, Chao Yang, Jing Shao, Yu Qiao

Viaarxiv icon

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models

Add code
Bookmark button
Alert button
Oct 17, 2023
Zhanhui Zhou, Jie Liu, Chao Yang, Jing Shao, Yu Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao

Viaarxiv icon

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization

Add code
Bookmark button
Alert button
Oct 05, 2023
Zhanhui Zhou, Jie Liu, Chao Yang, Jing Shao, Yu Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao

Viaarxiv icon