Alert button
Picture for Rong Bao

Rong Bao

Alert button

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

Add code
Bookmark button
Alert button
Feb 16, 2024
Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, Dacheng Tao

Viaarxiv icon

Orthogonal Subspace Learning for Language Model Continual Learning

Add code
Bookmark button
Alert button
Oct 22, 2023
Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

Figure 1 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 2 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 3 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 4 for Orthogonal Subspace Learning for Language Model Continual Learning
Viaarxiv icon

Robust Lottery Tickets for Pre-trained Language Models

Add code
Bookmark button
Alert button
Nov 06, 2022
Rui Zheng, Rong Bao, Yuhao Zhou, Di Liang, Sirui Wang, Wei Wu, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Robust Lottery Tickets for Pre-trained Language Models
Figure 2 for Robust Lottery Tickets for Pre-trained Language Models
Figure 3 for Robust Lottery Tickets for Pre-trained Language Models
Figure 4 for Robust Lottery Tickets for Pre-trained Language Models
Viaarxiv icon