Alert button
Picture for Xiaowei Du

Xiaowei Du

Alert button

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang

Figure 1 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 2 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 3 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 4 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Viaarxiv icon