Alert button
Picture for Ziang Song

Ziang Song

Alert button

Reward Collapse in Aligning Large Language Models

Add code
Bookmark button
Alert button
May 28, 2023
Ziang Song, Tianle Cai, Jason D. Lee, Weijie J. Su

Figure 1 for Reward Collapse in Aligning Large Language Models
Figure 2 for Reward Collapse in Aligning Large Language Models
Figure 3 for Reward Collapse in Aligning Large Language Models
Figure 4 for Reward Collapse in Aligning Large Language Models
Viaarxiv icon

Efficient $Φ$-Regret Minimization in Extensive-Form Games via Online Mirror Descent

Add code
Bookmark button
Alert button
Jun 02, 2022
Yu Bai, Chi Jin, Song Mei, Ziang Song, Tiancheng Yu

Viaarxiv icon

Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games

Add code
Bookmark button
Alert button
May 15, 2022
Ziang Song, Song Mei, Yu Bai

Viaarxiv icon

Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation

Add code
Bookmark button
Alert button
May 03, 2022
Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn

Figure 1 for Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
Figure 2 for Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
Figure 3 for Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
Figure 4 for Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
Viaarxiv icon

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

Add code
Bookmark button
Alert button
Oct 08, 2021
Ziang Song, Song Mei, Yu Bai

Figure 1 for When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Viaarxiv icon