Alert button
Picture for Deli Chen

Deli Chen

Alert button

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Bookmark button
Alert button
Jan 11, 2024
Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang

Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Bookmark button
Alert button
Jan 05, 2024
DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Bookmark button
Alert button
Dec 28, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

Math-Shepherd: A Label-Free Step-by-Step Verifier for LLMs in Mathematical Reasoning

Add code
Bookmark button
Alert button
Dec 14, 2023
Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

Viaarxiv icon

Towards Codable Text Watermarking for Large Language Models

Add code
Bookmark button
Alert button
Jul 29, 2023
Lean Wang, Wenkai Yang, Deli Chen, Hao Zhou, Yankai Lin, Fandong Meng, Jie Zhou, Xu Sun

Figure 1 for Towards Codable Text Watermarking for Large Language Models
Figure 2 for Towards Codable Text Watermarking for Large Language Models
Figure 3 for Towards Codable Text Watermarking for Large Language Models
Figure 4 for Towards Codable Text Watermarking for Large Language Models
Viaarxiv icon

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Add code
Bookmark button
Alert button
May 23, 2023
Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun

Figure 1 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 2 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 3 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 4 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Viaarxiv icon

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias

Add code
Bookmark button
Alert button
May 08, 2023
Zhiyuan Zhang, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun

Figure 1 for Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Figure 2 for Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Figure 3 for Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Figure 4 for Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Viaarxiv icon

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

Add code
Bookmark button
Alert button
Jan 26, 2023
Wenkai Yang, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun

Figure 1 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 2 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 3 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 4 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Viaarxiv icon

Topology-Imbalance Learning for Semi-Supervised Node Classification

Add code
Bookmark button
Alert button
Oct 08, 2021
Deli Chen, Yankai Lin, Guangxiang Zhao, Xuancheng Ren, Peng Li, Jie Zhou, Xu Sun

Figure 1 for Topology-Imbalance Learning for Semi-Supervised Node Classification
Figure 2 for Topology-Imbalance Learning for Semi-Supervised Node Classification
Figure 3 for Topology-Imbalance Learning for Semi-Supervised Node Classification
Figure 4 for Topology-Imbalance Learning for Semi-Supervised Node Classification
Viaarxiv icon

Accelerating Pre-trained Language Models via Calibrated Cascade

Add code
Bookmark button
Alert button
Dec 29, 2020
Lei Li, Yankai Lin, Shuhuai Ren, Deli Chen, Xuancheng Ren, Peng Li, Jie Zhou, Xu Sun

Figure 1 for Accelerating Pre-trained Language Models via Calibrated Cascade
Figure 2 for Accelerating Pre-trained Language Models via Calibrated Cascade
Figure 3 for Accelerating Pre-trained Language Models via Calibrated Cascade
Viaarxiv icon