Alert button
Picture for Xuekai Zhu

Xuekai Zhu

Alert button

Critical Data Size of Language Models from a Grokking Perspective

Add code
Bookmark button
Alert button
Feb 06, 2024
Xuekai Zhu, Yao Fu, Bowen Zhou, Zhouhan Lin

Viaarxiv icon

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

Add code
Bookmark button
Alert button
Oct 24, 2023
Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, Bowen Zhou

Viaarxiv icon

PaD: Program-aided Distillation Specializes Large Models in Reasoning

Add code
Bookmark button
Alert button
May 23, 2023
Xuekai Zhu, Biqing Qi, Kaiyan Zhang, Xingwei Long, Bowen Zhou

Figure 1 for PaD: Program-aided Distillation Specializes Large Models in Reasoning
Figure 2 for PaD: Program-aided Distillation Specializes Large Models in Reasoning
Figure 3 for PaD: Program-aided Distillation Specializes Large Models in Reasoning
Figure 4 for PaD: Program-aided Distillation Specializes Large Models in Reasoning
Viaarxiv icon

StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing

Add code
Bookmark button
Alert button
Aug 29, 2022
Xuekai Zhu, Jian Guan, Minlie Huang, Juan Liu

Figure 1 for StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Figure 2 for StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Figure 3 for StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Figure 4 for StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Viaarxiv icon