Alert button
Picture for Xiao Liu

Xiao Liu

Alert button

Rho-1: Not All Tokens Are What You Need

Add code
Bookmark button
Alert button
Apr 11, 2024
Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen

Viaarxiv icon

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Add code
Bookmark button
Alert button
Apr 04, 2024
Hanyu Lai, Xiao Liu, Iat Long Iong, Shuntian Yao, Yuxuan Chen, Pengbo Shen, Hao Yu, Hanchen Zhang, Xiaohan Zhang, Yuxiao Dong, Jie Tang

Viaarxiv icon

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Add code
Bookmark button
Alert button
Apr 03, 2024
Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Viaarxiv icon

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

Add code
Bookmark button
Alert button
Apr 03, 2024
Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

Viaarxiv icon

Extensive Self-Contrast Enables Feedback-Free Language Model Alignment

Add code
Bookmark button
Alert button
Mar 31, 2024
Xiao Liu, Xixuan Song, Yuxiao Dong, Jie Tang

Viaarxiv icon

GPTA: Generative Prompt Tuning Assistant for Synergistic Downstream Neural Network Enhancement with LLMs

Add code
Bookmark button
Alert button
Mar 29, 2024
Xiao Liu, Jiawei Zhang

Viaarxiv icon

Can multiple-choice questions really be useful in detecting the abilities of LLMs?

Add code
Bookmark button
Alert button
Mar 28, 2024
Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng, Noa Garcia

Viaarxiv icon

iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch

Add code
Bookmark button
Alert button
Mar 11, 2024
Fabian C Weigend, Xiao Liu, Shubham Sonawani, Neelesh Kumar, Venugopal Vasudevan, Heni Ben Amor

Figure 1 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 2 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 3 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 4 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Viaarxiv icon

Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization

Add code
Bookmark button
Alert button
Mar 05, 2024
Yuan Lin, Antai Xie, Xiao Liu

Figure 1 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 2 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 3 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 4 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Viaarxiv icon

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Add code
Bookmark button
Alert button
Mar 04, 2024
Yiming Huang, Xiao Liu, Yeyun Gong, Zhibin Gou, Yelong Shen, Nan Duan, Weizhu Chen

Figure 1 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 2 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 3 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 4 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Viaarxiv icon