Alert button
Picture for Xiaodong Liu

Xiaodong Liu

Alert button

SWEA: Changing Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Jan 31, 2024
Xiaopeng Li, Shasha Li, Bin Ji, Shezheng Song, Xi Wang, Jun Ma, Jie Yu, Xiaodong Liu, Jing Wang, Weimin Zhang

Viaarxiv icon

Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning

Jan 25, 2024
Yanda Chen, Chandan Singh, Xiaodong Liu, Simiao Zuo, Bin Yu, He He, Jianfeng Gao

Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Jan 05, 2024
DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Nov 03, 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao

Viaarxiv icon

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Oct 19, 2023
Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao

Figure 1 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 2 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 3 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 4 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Viaarxiv icon

Fast-ELECTRA for Efficient Pre-training

Oct 11, 2023
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu

Figure 1 for Fast-ELECTRA for Efficient Pre-training
Figure 2 for Fast-ELECTRA for Efficient Pre-training
Figure 3 for Fast-ELECTRA for Efficient Pre-training
Figure 4 for Fast-ELECTRA for Efficient Pre-training
Viaarxiv icon

Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System

Aug 29, 2023
Zheng Xiong, Biao Luo, Bing-Chuan Wang, Xiaodong Xu, Xiaodong Liu, Tingwen Huang

Figure 1 for Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System
Figure 2 for Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System
Figure 3 for Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System
Figure 4 for Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System
Viaarxiv icon

Augmenting Language Models with Long-Term Memory

Jun 12, 2023
Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei

Figure 1 for Augmenting Language Models with Long-Term Memory
Figure 2 for Augmenting Language Models with Long-Term Memory
Figure 3 for Augmenting Language Models with Long-Term Memory
Figure 4 for Augmenting Language Models with Long-Term Memory
Viaarxiv icon

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding

May 23, 2023
Yu Zhang, Hao Cheng, Zhihong Shen, Xiaodong Liu, Ye-Yi Wang, Jianfeng Gao

Figure 1 for Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Figure 2 for Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Figure 3 for Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Figure 4 for Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Viaarxiv icon

Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers

May 21, 2023
Linyuan Gong, Chenyan Xiong, Xiaodong Liu, Payal Bajaj, Yiqing Xie, Alvin Cheung, Jianfeng Gao, Xia Song

Figure 1 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 2 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 3 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 4 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Viaarxiv icon