Picture for Xiao Liu

Xiao Liu

School of Computer Science and Technology, Anhui University

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

Add code
Apr 03, 2024
Figure 1 for ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
Figure 2 for ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
Figure 3 for ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
Figure 4 for ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
Viaarxiv icon

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Add code
Apr 03, 2024
Figure 1 for ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Figure 2 for ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Figure 3 for ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Figure 4 for ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Viaarxiv icon

Extensive Self-Contrast Enables Feedback-Free Language Model Alignment

Add code
Mar 31, 2024
Viaarxiv icon

GPTA: Generative Prompt Tuning Assistant for Synergistic Downstream Neural Network Enhancement with LLMs

Add code
Mar 29, 2024
Viaarxiv icon

Can multiple-choice questions really be useful in detecting the abilities of LLMs?

Add code
Mar 28, 2024
Figure 1 for Can multiple-choice questions really be useful in detecting the abilities of LLMs?
Figure 2 for Can multiple-choice questions really be useful in detecting the abilities of LLMs?
Figure 3 for Can multiple-choice questions really be useful in detecting the abilities of LLMs?
Figure 4 for Can multiple-choice questions really be useful in detecting the abilities of LLMs?
Viaarxiv icon

iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch

Add code
Mar 11, 2024
Figure 1 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 2 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 3 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Figure 4 for iRoCo: Intuitive Robot Control From Anywhere Using a Smartwatch
Viaarxiv icon

Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization

Add code
Mar 05, 2024
Figure 1 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 2 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 3 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Figure 4 for Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Viaarxiv icon

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Add code
Mar 04, 2024
Figure 1 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 2 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 3 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 4 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Viaarxiv icon

Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models

Add code
Mar 01, 2024
Figure 1 for Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models
Figure 2 for Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models
Figure 3 for Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models
Figure 4 for Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models
Viaarxiv icon

Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change

Add code
Mar 01, 2024
Figure 1 for Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change
Figure 2 for Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change
Figure 3 for Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change
Figure 4 for Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change
Viaarxiv icon