Alert button
Picture for Yang Liu

Yang Liu

Alert button

Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

Add code
Bookmark button
Alert button
Feb 19, 2024
Tianlin Li, Xiaoyu Zhang, Chao Du, Tianyu Pang, Qian Liu, Qing Guo, Chao Shen, Yang Liu

Viaarxiv icon

Purifying Large Language Models by Ensembling a Small Language Model

Add code
Bookmark button
Alert button
Feb 19, 2024
Tianlin Li, Qian Liu, Tianyu Pang, Chao Du, Qing Guo, Yang Liu, Min Lin

Viaarxiv icon

Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement

Add code
Bookmark button
Alert button
Feb 19, 2024
Zijun Liu, Boqun Kou, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu

Viaarxiv icon

Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation

Add code
Bookmark button
Alert button
Feb 19, 2024
Yi Liu, Guowei Yang, Gelei Deng, Feiyue Chen, Yuqi Chen, Ling Shi, Tianwei Zhang, Yang Liu

Viaarxiv icon

Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models

Add code
Bookmark button
Alert button
Feb 19, 2024
Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li, Yang Liu

Viaarxiv icon

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues

Add code
Bookmark button
Alert button
Feb 16, 2024
Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu

Viaarxiv icon

Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation

Add code
Bookmark button
Alert button
Feb 16, 2024
Xinjian Zhao, Liang Zhang, Yang Liu, Ruocheng Guo, Xiangyu Zhao

Viaarxiv icon

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

Add code
Bookmark button
Alert button
Feb 16, 2024
Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Bookmark button
Alert button
Feb 15, 2024
Sijia Liu, Yuanshun Yao, Jinghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Xiaojun Xu, Yuguang Yao, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

Viaarxiv icon

Towards Unified Alignment Between Agents, Humans, and Environment

Add code
Bookmark button
Alert button
Feb 14, 2024
Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu

Viaarxiv icon