Alert button
Picture for Fengshuo Bai

Fengshuo Bai

Alert button

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

Add code
Bookmark button
Alert button
Mar 01, 2024
Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

Viaarxiv icon

Measuring Value Understanding in Language Models through Discriminator-Critique Gap

Add code
Bookmark button
Alert button
Oct 19, 2023
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang

Figure 1 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 2 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 3 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Figure 4 for Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Viaarxiv icon

Zero-shot Preference Learning for Offline RL via Optimal Transport

Add code
Bookmark button
Alert button
Jun 06, 2023
Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li

Figure 1 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 2 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 3 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Figure 4 for Zero-shot Preference Learning for Offline RL via Optimal Transport
Viaarxiv icon