Alert button
Picture for Baobao Chang

Baobao Chang

Alert button

StableMoE: Stable Routing Strategy for Mixture of Experts

Add code
Bookmark button
Alert button
Apr 18, 2022
Damai Dai, Li Dong, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei

Figure 1 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 2 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 3 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 4 for StableMoE: Stable Routing Strategy for Mixture of Experts
Viaarxiv icon

Mixture of Experts for Biomedical Question Answering

Add code
Bookmark button
Alert button
Apr 15, 2022
Damai Dai, Wenbin Jiang, Jiyuan Zhang, Weihua Peng, Yajuan Lyu, Zhifang Sui, Baobao Chang, Yong Zhu

Figure 1 for Mixture of Experts for Biomedical Question Answering
Figure 2 for Mixture of Experts for Biomedical Question Answering
Figure 3 for Mixture of Experts for Biomedical Question Answering
Figure 4 for Mixture of Experts for Biomedical Question Answering
Viaarxiv icon

DISK: Domain-constrained Instance Sketch for Math Word Problem Generation

Add code
Bookmark button
Alert button
Apr 10, 2022
Tianyang Cao, Shuang Zeng, Xiaodan Xu, Mairgup Mansur, Baobao Chang

Figure 1 for DISK: Domain-constrained Instance Sketch for Math Word Problem Generation
Figure 2 for DISK: Domain-constrained Instance Sketch for Math Word Problem Generation
Figure 3 for DISK: Domain-constrained Instance Sketch for Math Word Problem Generation
Figure 4 for DISK: Domain-constrained Instance Sketch for Math Word Problem Generation
Viaarxiv icon

A Roadmap for Big Model

Add code
Bookmark button
Alert button
Apr 02, 2022
Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, Huawei Shen, Hui Zhang, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan Yao, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, Liwei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

Figure 1 for A Roadmap for Big Model
Figure 2 for A Roadmap for Big Model
Figure 3 for A Roadmap for Big Model
Figure 4 for A Roadmap for Big Model
Viaarxiv icon

Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation

Add code
Bookmark button
Alert button
Mar 11, 2022
Liang Chen, Runxin Xu, Baobao Chang

Figure 1 for Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation
Figure 2 for Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation
Figure 3 for Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation
Figure 4 for Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation
Viaarxiv icon

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Add code
Bookmark button
Alert button
Dec 14, 2021
Runxin Xu, Fuli Luo, Chengyu Wang, Baobao Chang, Jun Huang, Songfang Huang, Fei Huang

Figure 1 for From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression
Figure 2 for From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression
Figure 3 for From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression
Figure 4 for From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression
Viaarxiv icon

Hierarchical Curriculum Learning for AMR Parsing

Add code
Bookmark button
Alert button
Oct 15, 2021
Peiyi Wang, Liang Chen, Tianyu Liu, Baobao Chang, Zhifang Sui

Figure 1 for Hierarchical Curriculum Learning for AMR Parsing
Figure 2 for Hierarchical Curriculum Learning for AMR Parsing
Figure 3 for Hierarchical Curriculum Learning for AMR Parsing
Figure 4 for Hierarchical Curriculum Learning for AMR Parsing
Viaarxiv icon

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling

Add code
Bookmark button
Alert button
Sep 27, 2021
Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui

Figure 1 for An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling
Figure 2 for An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling
Figure 3 for An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling
Figure 4 for An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling
Viaarxiv icon

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning

Add code
Bookmark button
Alert button
Sep 13, 2021
Runxin Xu, Fuli Luo, Zhiyuan Zhang, Chuanqi Tan, Baobao Chang, Songfang Huang, Fei Huang

Figure 1 for Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Figure 2 for Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Figure 3 for Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Figure 4 for Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Viaarxiv icon