Alert button
Picture for Zhihong Shao

Zhihong Shao

Alert button

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Add code
Bookmark button
Alert button
Feb 06, 2024
Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Mingchuan Zhang, Y. K. Li, Y. Wu, Daya Guo

Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Bookmark button
Alert button
Jan 05, 2024
DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Bookmark button
Alert button
Dec 28, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

Math-Shepherd: A Label-Free Step-by-Step Verifier for LLMs in Mathematical Reasoning

Add code
Bookmark button
Alert button
Dec 14, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Add code
Bookmark button
Alert button
Oct 04, 2023
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen

Figure 1 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 2 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 3 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 4 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Viaarxiv icon

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Add code
Bookmark button
Alert button
May 24, 2023
Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen

Figure 1 for Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Figure 2 for Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Figure 3 for Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Figure 4 for Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Viaarxiv icon

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Add code
Bookmark button
Alert button
May 19, 2023
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen

Figure 1 for CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Figure 2 for CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Figure 3 for CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Figure 4 for CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Viaarxiv icon

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models

Add code
Bookmark button
Alert button
Feb 01, 2023
Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen

Figure 1 for Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
Figure 2 for Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
Figure 3 for Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
Figure 4 for Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
Viaarxiv icon