Picture for Xinbo Zhang

Xinbo Zhang

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Feb 08, 2024
Viaarxiv icon

ReFT: Reasoning with Reinforced Fine-Tuning

Add code
Jan 17, 2024
Viaarxiv icon

Design of Chain-of-Thought in Math Problem Solving

Add code
Sep 30, 2023
Figure 1 for Design of Chain-of-Thought in Math Problem Solving
Figure 2 for Design of Chain-of-Thought in Math Problem Solving
Figure 3 for Design of Chain-of-Thought in Math Problem Solving
Figure 4 for Design of Chain-of-Thought in Math Problem Solving
Viaarxiv icon

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning

Add code
Mar 16, 2022
Figure 1 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 2 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 3 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Figure 4 for E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Viaarxiv icon

Probabilistic Graph Reasoning for Natural Proof Generation

Add code
Jul 06, 2021
Figure 1 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 2 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 3 for Probabilistic Graph Reasoning for Natural Proof Generation
Figure 4 for Probabilistic Graph Reasoning for Natural Proof Generation
Viaarxiv icon