Alert button

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang

Figure 1 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 2 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 3 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Figure 4 for DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: