Alert button
Picture for Qing-Shan Jia

Qing-Shan Jia

Alert button

Query-Policy Misalignment in Preference-Based Reinforcement Learning

Add code
Bookmark button
Alert button
May 27, 2023
Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang

Figure 1 for Query-Policy Misalignment in Preference-Based Reinforcement Learning
Figure 2 for Query-Policy Misalignment in Preference-Based Reinforcement Learning
Figure 3 for Query-Policy Misalignment in Preference-Based Reinforcement Learning
Figure 4 for Query-Policy Misalignment in Preference-Based Reinforcement Learning
Viaarxiv icon

Mind the Gap: Offline Policy Optimization for Imperfect Rewards

Add code
Bookmark button
Alert button
Feb 03, 2023
Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang

Figure 1 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 2 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 3 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 4 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Viaarxiv icon

Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method

Add code
Bookmark button
Alert button
Oct 31, 2021
Kuo Li, Qing-Shan Jia

Figure 1 for Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Figure 2 for Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Viaarxiv icon

An Actor-Critic Method for Simulation-Based Optimization

Add code
Bookmark button
Alert button
Oct 31, 2021
Kuo Li, Qing-Shan Jia, Jiaqi Yan

Figure 1 for An Actor-Critic Method for Simulation-Based Optimization
Figure 2 for An Actor-Critic Method for Simulation-Based Optimization
Figure 3 for An Actor-Critic Method for Simulation-Based Optimization
Figure 4 for An Actor-Critic Method for Simulation-Based Optimization
Viaarxiv icon