Alert button
Picture for Weizhu Chen

Weizhu Chen

Alert button

Rho-1: Not All Tokens Are What You Need

Add code
Bookmark button
Alert button
Apr 11, 2024
Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen

Viaarxiv icon

A Note on LoRA

Add code
Bookmark button
Alert button
Apr 07, 2024
Vlad Fomenko, Han Yu, Jongho Lee, Stanley Hsieh, Weizhu Chen

Viaarxiv icon

Exploring the Mystery of Influential Data for Mathematical Reasoning

Add code
Bookmark button
Alert button
Apr 01, 2024
Xinzhe Ni, Yeyun Gong, Zhibin Gou, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen

Viaarxiv icon

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Add code
Bookmark button
Alert button
Mar 04, 2024
Yiming Huang, Xiao Liu, Yeyun Gong, Zhibin Gou, Yelong Shen, Nan Duan, Weizhu Chen

Figure 1 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 2 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 3 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 4 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Viaarxiv icon

Multi-LoRA Composition for Image Generation

Add code
Bookmark button
Alert button
Feb 26, 2024
Ming Zhong, Yelong Shen, Shuohang Wang, Yadong Lu, Yizhu Jiao, Siru Ouyang, Donghan Yu, Jiawei Han, Weizhu Chen

Viaarxiv icon

SciAgent: Tool-augmented Language Models for Scientific Reasoning

Add code
Bookmark button
Alert button
Feb 21, 2024
Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun, Hany Awadalla, Weizhu Chen

Viaarxiv icon

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

Add code
Bookmark button
Alert button
Feb 12, 2024
Yueqin Yin, Zhendong Wang, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan Zhou

Viaarxiv icon

Supervised Knowledge Makes Large Language Models Better In-context Learners

Add code
Bookmark button
Alert button
Dec 26, 2023
Linyi Yang, Shuibai Zhang, Zhuohao Yu, Guangsheng Bao, Yidong Wang, Jindong Wang, Ruochen Xu, Wei Ye, Xing Xie, Weizhu Chen, Yue Zhang

Viaarxiv icon

Competition-Level Problems are Effective LLM Evaluators

Add code
Bookmark button
Alert button
Dec 05, 2023
Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, Yelong Shen, Chen Lin, Nan Duan, Weizhu Chen

Figure 1 for Competition-Level Problems are Effective LLM Evaluators
Figure 2 for Competition-Level Problems are Effective LLM Evaluators
Figure 3 for Competition-Level Problems are Effective LLM Evaluators
Figure 4 for Competition-Level Problems are Effective LLM Evaluators
Viaarxiv icon

Learning From Mistakes Makes LLM Better Reasoner

Add code
Bookmark button
Alert button
Nov 14, 2023
Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

Figure 1 for Learning From Mistakes Makes LLM Better Reasoner
Figure 2 for Learning From Mistakes Makes LLM Better Reasoner
Figure 3 for Learning From Mistakes Makes LLM Better Reasoner
Figure 4 for Learning From Mistakes Makes LLM Better Reasoner
Viaarxiv icon