Alert button
Picture for Weizhu Chen

Weizhu Chen

Alert button

Language Models can be Logical Solvers

Add code
Bookmark button
Alert button
Nov 10, 2023
Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao, Weizhu Chen

Figure 1 for Language Models can be Logical Solvers
Figure 2 for Language Models can be Logical Solvers
Figure 3 for Language Models can be Logical Solvers
Figure 4 for Language Models can be Logical Solvers
Viaarxiv icon

Learning From Mistakes Makes LLM Better Reasoner

Add code
Bookmark button
Alert button
Oct 31, 2023
Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

Figure 1 for Learning From Mistakes Makes LLM Better Reasoner
Figure 2 for Learning From Mistakes Makes LLM Better Reasoner
Figure 3 for Learning From Mistakes Makes LLM Better Reasoner
Figure 4 for Learning From Mistakes Makes LLM Better Reasoner
Viaarxiv icon

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Yixiao Li, Yifan Yu, Chen Liang, Pengcheng He, Nikos Karampatziakis, Weizhu Chen, Tuo Zhao

Figure 1 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 2 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 3 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 4 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Viaarxiv icon

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Add code
Bookmark button
Alert button
Oct 17, 2023
Ming Zhong, Chenxin An, Weizhu Chen, Jiawei Han, Pengcheng He

Figure 1 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 2 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 3 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 4 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Viaarxiv icon

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Add code
Bookmark button
Alert button
Oct 04, 2023
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen

Figure 1 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 2 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 3 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Figure 4 for ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Viaarxiv icon

Sparse Backpropagation for MoE Training

Add code
Bookmark button
Alert button
Oct 01, 2023
Liyuan Liu, Jianfeng Gao, Weizhu Chen

Viaarxiv icon

Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency

Add code
Bookmark button
Alert button
Sep 29, 2023
Baizhou Huang, Shuai Lu, Weizhu Chen, Xiaojun Wan, Nan Duan

Figure 1 for Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency
Figure 2 for Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency
Figure 3 for Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency
Figure 4 for Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency
Viaarxiv icon