Alert button
Picture for Xinbo Zhang

Xinbo Zhang

Alert button

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Feb 08, 2024
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

Viaarxiv icon

ReFT: Reasoning with Reinforced Fine-Tuning

Jan 17, 2024
Trung Quoc Luong, Xinbo Zhang, Zhanming Jie, Peng Sun, Xiaoran Jin, Hang Li

Viaarxiv icon

Design of Chain-of-Thought in Math Problem Solving

Sep 30, 2023
Zhanming Jie, Trung Quoc Luong, Xinbo Zhang, Xiaoran Jin, Hang Li

Viaarxiv icon

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning

Mar 16, 2022
Jiangjie Chen, Rui Xu, Ziquan Fu, Wei Shi, Zhongqiao Li, Xinbo Zhang, Changzhi Sun, Lei Li, Yanghua Xiao, Hao Zhou

Figure 1 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 2 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 3 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 4 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Viaarxiv icon

Probabilistic Graph Reasoning for Natural Proof Generation

Jul 06, 2021
Changzhi Sun, Xinbo Zhang, Jiangjie Chen, Chun Gan, Yuanbin Wu, Jiaze Chen, Hao Zhou, Lei Li

Figure 1 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 2 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 3 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 4 for Probabilistic Graph Reasoning for Natural Proof Generation
Viaarxiv icon