Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Self-Evolution Learning for Discriminative Language Model Pretraining


May 24, 2023
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

Add code

* Accepted to Findings of ACL2023 

   Access Paper or Ask Questions

Revisiting Token Dropping Strategy in Efficient BERT Pretraining


May 24, 2023
Qihuang Zhong, Liang Ding, Juhua Liu, Xuebo Liu, Min Zhang, Bo Du, Dacheng Tao

Add code

* Accepted to ACL2023 Main Conference 

   Access Paper or Ask Questions

Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks


May 22, 2023
Haoqi Zheng, Qihuang Zhong, Liang Ding, Zhiliang Tian, Xin Niu, Dongsheng Li, Dacheng Tao

Add code


   Access Paper or Ask Questions

Towards Making the Most of ChatGPT for Machine Translation


Mar 24, 2023
Keqin Peng, Liang Ding, Qihuang Zhong, Li Shen, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao

Add code

* Work in progress, 9 pages 

   Access Paper or Ask Questions

Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT


Mar 02, 2023
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

Add code

* Work in progress. Added results of advanced prompting strategies, e.g., CoT. (19 pages) 

   Access Paper or Ask Questions

AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks


Mar 01, 2023
Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

Add code

* 18 pages 

   Access Paper or Ask Questions

Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE


Feb 18, 2023
Qihuang Zhong, Liang Ding, Keqin Peng, Juhua Liu, Bo Du, Li Shen, Yibing Zhan, Dacheng Tao

Add code

* Technical report. arXiv admin note: text overlap with arXiv:2212.01853 

   Access Paper or Ask Questions

Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE


Dec 04, 2022
Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, Dacheng Tao

Add code

* Technical report 

   Access Paper or Ask Questions

Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models


Oct 11, 2022
Qihuang Zhong, Liang Ding, Li Shen, Peng Mi, Juhua Liu, Bo Du, Dacheng Tao

Add code

* Accepted by EMNLP 2022 (Findings) 

   Access Paper or Ask Questions

1
2
>>