Alert button
Picture for Shiwen Ni

Shiwen Ni

Alert button

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning

Add code
Bookmark button
Alert button
Mar 26, 2024
Yuelin Bai, Xinrun Du, Yiming Liang, Yonggang Jin, Ziqiang Liu, Junting Zhou, Tianyu Zheng, Xincheng Zhang, Nuo Ma, Zekun Wang, Ruibin Yuan, Haihong Wu, Hongquan Lin, Wenhao Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Min Yang, Shiwen Ni, Ge Zhang

Viaarxiv icon

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property

Add code
Bookmark button
Alert button
Feb 26, 2024
Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li, Xiping Hu, Ye Li, Jianping Fan

Viaarxiv icon

Layer-wise Regularized Dropout for Neural Language Models

Add code
Bookmark button
Alert button
Feb 26, 2024
Shiwen Ni, Min Yang, Ruifeng Xu, Chengming Li, Xiping Hu

Viaarxiv icon

History, Development, and Principles of Large Language Models-An Introductory Survey

Add code
Bookmark button
Alert button
Feb 10, 2024
Zhibo Chu, Shiwen Ni, Zichong Wang, Xi Feng, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang, Wenbin Zhang

Viaarxiv icon

E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models

Add code
Bookmark button
Alert button
Jan 29, 2024
Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang

Viaarxiv icon

Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang

Figure 1 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 2 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 3 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Figure 4 for Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
Viaarxiv icon

ELECTRA is a Zero-Shot Learner, Too

Add code
Bookmark button
Alert button
Jul 20, 2022
Shiwen Ni, Hung-Yu Kao

Figure 1 for ELECTRA is a Zero-Shot Learner, Too
Figure 2 for ELECTRA is a Zero-Shot Learner, Too
Figure 3 for ELECTRA is a Zero-Shot Learner, Too
Figure 4 for ELECTRA is a Zero-Shot Learner, Too
Viaarxiv icon

True or False: Does the Deep Learning Model Learn to Detect Rumors?

Add code
Bookmark button
Alert button
Dec 01, 2021
Shiwen Ni, Jiawen Li, Hung-Yu Kao

Figure 1 for True or False: Does the Deep Learning Model Learn to Detect Rumors?
Figure 2 for True or False: Does the Deep Learning Model Learn to Detect Rumors?
Figure 3 for True or False: Does the Deep Learning Model Learn to Detect Rumors?
Figure 4 for True or False: Does the Deep Learning Model Learn to Detect Rumors?
Viaarxiv icon

DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks

Add code
Bookmark button
Alert button
Aug 29, 2021
Shiwen Ni, Jiawen Li, Hung-Yu Kao

Figure 1 for DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Figure 2 for DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Figure 3 for DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Figure 4 for DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks
Viaarxiv icon