Alert button
Picture for Weizhu Chen

Weizhu Chen

Alert button

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing

Add code
Bookmark button
Alert button
Apr 13, 2022
Chen Liang, Pengcheng He, Yelong Shen, Weizhu Chen, Tuo Zhao

Figure 1 for CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Figure 2 for CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Figure 3 for CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Figure 4 for CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Viaarxiv icon

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Add code
Bookmark button
Alert button
Mar 28, 2022
Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

Figure 1 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 2 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 3 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 4 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Viaarxiv icon

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

Add code
Bookmark button
Alert button
Mar 07, 2022
Shengnan An, Yifei Li, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou

Figure 1 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 2 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 3 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 4 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Viaarxiv icon

Controllable Natural Language Generation with Contrastive Prefixes

Add code
Bookmark button
Alert button
Feb 27, 2022
Jing Qian, Li Dong, Yelong Shen, Furu Wei, Weizhu Chen

Figure 1 for Controllable Natural Language Generation with Contrastive Prefixes
Figure 2 for Controllable Natural Language Generation with Contrastive Prefixes
Figure 3 for Controllable Natural Language Generation with Contrastive Prefixes
Figure 4 for Controllable Natural Language Generation with Contrastive Prefixes
Viaarxiv icon

Truncated Diffusion Probabilistic Models

Add code
Bookmark button
Alert button
Feb 19, 2022
Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou

Figure 1 for Truncated Diffusion Probabilistic Models
Figure 2 for Truncated Diffusion Probabilistic Models
Figure 3 for Truncated Diffusion Probabilistic Models
Figure 4 for Truncated Diffusion Probabilistic Models
Viaarxiv icon

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models

Add code
Bookmark button
Alert button
Feb 14, 2022
Chen Liang, Haoming Jiang, Simiao Zuo, Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen, Tuo Zhao

Figure 1 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 2 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 3 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 4 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Viaarxiv icon

Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs

Add code
Bookmark button
Alert button
Feb 14, 2022
Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou

Figure 1 for Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Figure 2 for Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Figure 3 for Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Figure 4 for Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Viaarxiv icon

Reasoning Like Program Executors

Add code
Bookmark button
Alert button
Jan 27, 2022
Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Yan Gao, Qiang Fu, Jian-Guang Lou, Weizhu Chen

Figure 1 for Reasoning Like Program Executors
Figure 2 for Reasoning Like Program Executors
Figure 3 for Reasoning Like Program Executors
Figure 4 for Reasoning Like Program Executors
Viaarxiv icon