Alert button
Picture for Damai Dai

Damai Dai

Alert button

Large Language Models Are Unconscious of Unreasonability in Math Problems

Add code
Bookmark button
Alert button
Mar 28, 2024
Jingyuan Ma, Damai Dai, Zhifang Sui

Viaarxiv icon

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Add code
Bookmark button
Alert button
Feb 25, 2024
Xiangdi Meng, Damai Dai, Weiyao Luo, Zhe Yang, Shaoxiang Wu, Xiaochen Wang, Peiyi Wang, Qingxiu Dong, Liang Chen, Zhifang Sui

Viaarxiv icon

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Bookmark button
Alert button
Jan 11, 2024
Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang

Viaarxiv icon

Language Models Understand Numbers, at Least Partially

Add code
Bookmark button
Alert button
Jan 08, 2024
Fangwei Zhu, Damai Dai, Zhifang Sui

Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Bookmark button
Alert button
Jan 05, 2024
DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Bookmark button
Alert button
Dec 28, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

Math-Shepherd: A Label-Free Step-by-Step Verifier for LLMs in Mathematical Reasoning

Add code
Bookmark button
Alert button
Dec 14, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning

Add code
Bookmark button
Alert button
Oct 12, 2023
Zhe Yang, Damai Dai, Peiyi Wang, Zhifang Sui

Figure 1 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 2 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 3 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Figure 4 for Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Viaarxiv icon

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

Add code
Bookmark button
Alert button
May 25, 2023
Shaoxaing Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

Figure 1 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 2 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 3 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 4 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Viaarxiv icon

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization

Add code
Bookmark button
Alert button
May 24, 2023
Shoujie Tong, Heming Xia, Damai Dai, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

Figure 1 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 2 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 3 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 4 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Viaarxiv icon