Alert button
Picture for Senjie Jin

Senjie Jin

Alert button

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 08, 2024
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Add code
Bookmark button
Alert button
Jan 30, 2024
Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Add code
Bookmark button
Alert button
Jan 12, 2024
Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Figure 2 for TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Figure 3 for TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Figure 4 for TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Viaarxiv icon

The Rise and Potential of Large Language Model Based Agents: A Survey

Add code
Bookmark button
Alert button
Sep 19, 2023
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui

Figure 1 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 2 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 3 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 4 for The Rise and Potential of Large Language Model Based Agents: A Survey
Viaarxiv icon

Secrets of RLHF in Large Language Models Part I: PPO

Add code
Bookmark button
Alert button
Jul 18, 2023
Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang

Figure 1 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 2 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 3 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 4 for Secrets of RLHF in Large Language Models Part I: PPO
Viaarxiv icon

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Add code
Bookmark button
Alert button
May 23, 2023
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Figure 2 for Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Figure 3 for Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Figure 4 for Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Viaarxiv icon