Alert button
Picture for Wei Fu

Wei Fu

Alert button

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Add code
Bookmark button
Alert button
Apr 16, 2024
Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

Viaarxiv icon

Learning Agile Bipedal Motions on a Quadrupedal Robot

Add code
Bookmark button
Alert button
Nov 10, 2023
Yunfei Li, Jinhan Li, Wei Fu, Yi Wu

Figure 1 for Learning Agile Bipedal Motions on a Quadrupedal Robot
Figure 2 for Learning Agile Bipedal Motions on a Quadrupedal Robot
Figure 3 for Learning Agile Bipedal Motions on a Quadrupedal Robot
Figure 4 for Learning Agile Bipedal Motions on a Quadrupedal Robot
Viaarxiv icon

Iteratively Learn Diverse Strategies with State Distance Information

Add code
Bookmark button
Alert button
Oct 23, 2023
Wei Fu, Weihua Du, Jingwei Li, Sunli Chen, Jingzhao Zhang, Yi Wu

Figure 1 for Iteratively Learn Diverse Strategies with State Distance Information
Figure 2 for Iteratively Learn Diverse Strategies with State Distance Information
Figure 3 for Iteratively Learn Diverse Strategies with State Distance Information
Figure 4 for Iteratively Learn Diverse Strategies with State Distance Information
Viaarxiv icon

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Add code
Bookmark button
Alert button
Jul 05, 2023
Zhiyu Mei, Wei Fu, Guangju Wang, Huanchen Zhang, Yi Wu

Figure 1 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 2 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 3 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Figure 4 for SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Viaarxiv icon

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2022
Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu

Figure 1 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization

Add code
Bookmark button
Alert button
Apr 04, 2022
Zihan Zhou, Wei Fu, Bingliang Zhang, Yi Wu

Figure 1 for Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Figure 2 for Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Figure 3 for Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Figure 4 for Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Viaarxiv icon

How to "DODGE" Complex Software Analytics?

Add code
Bookmark button
Alert button
Feb 05, 2019
Amritanshu Agrawal, Wei Fu, Di Chen, Xipeng Shen, Tim Menzies

Figure 1 for How to "DODGE" Complex Software Analytics?
Figure 2 for How to "DODGE" Complex Software Analytics?
Figure 3 for How to "DODGE" Complex Software Analytics?
Figure 4 for How to "DODGE" Complex Software Analytics?
Viaarxiv icon

What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)

Add code
Bookmark button
Alert button
Feb 20, 2018
Amritanshu Agrawal, Wei Fu, Tim Menzies

Figure 1 for What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)
Figure 2 for What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)
Figure 3 for What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)
Figure 4 for What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)
Viaarxiv icon

500+ Times Faster Than Deep Learning (A Case Study Exploring Faster Methods for Text Mining StackOverflow)

Add code
Bookmark button
Alert button
Feb 14, 2018
Suvodeep Majumder, Nikhila Balaji, Katie Brey, Wei Fu, Tim Menzies

Figure 1 for 500+ Times Faster Than Deep Learning (A Case Study Exploring Faster Methods for Text Mining StackOverflow)
Figure 2 for 500+ Times Faster Than Deep Learning (A Case Study Exploring Faster Methods for Text Mining StackOverflow)
Figure 3 for 500+ Times Faster Than Deep Learning (A Case Study Exploring Faster Methods for Text Mining StackOverflow)
Figure 4 for 500+ Times Faster Than Deep Learning (A Case Study Exploring Faster Methods for Text Mining StackOverflow)
Viaarxiv icon