Alert button
Picture for Guangju Wang

Guangju Wang

Alert button

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Add code
Bookmark button
Alert button
Apr 16, 2024
Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

Viaarxiv icon

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Add code
Bookmark button
Alert button
Jul 05, 2023
Zhiyu Mei, Wei Fu, Guangju Wang, Huanchen Zhang, Yi Wu

Figure 1 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 2 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 3 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 4 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Viaarxiv icon