Alert button
Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

Alert button

Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO

Add code
Bookmark button
Alert button
Oct 26, 2021
Hsuan-Yu Yao, Ping-Chun Hsieh, Kuo-Hao Ho, Kai-Chun Hu, Liang-Chun Ouyang, I-Chen Wu

Figure 1 for Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO
Figure 2 for Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO
Figure 3 for Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO
Viaarxiv icon

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

Add code
Bookmark button
Alert button
Oct 05, 2021
Khaled Nakhleh, Santosh Ganji, Ping-Chun Hsieh, I-Hong Hou, Srinivas Shakkottai

Figure 1 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 2 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 3 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 4 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Viaarxiv icon

Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization

Add code
Bookmark button
Alert button
Jun 08, 2021
Bing-Jing Hsieh, Ping-Chun Hsieh, Xi Liu

Figure 1 for Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization
Figure 2 for Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization
Figure 3 for Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization
Figure 4 for Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization
Viaarxiv icon

Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization

Add code
Bookmark button
Alert button
Feb 22, 2021
Jyun-Li Lin, Wei Hung, Shang-Hsuan Yang, Ping-Chun Hsieh, Xi Liu

Figure 1 for Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Figure 2 for Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Figure 3 for Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Figure 4 for Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Viaarxiv icon

Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits

Add code
Bookmark button
Alert button
Oct 08, 2020
Yu-Heng Hung, Ping-Chun Hsieh, Xi Liu, P. R. Kumar

Figure 1 for Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Figure 2 for Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Figure 3 for Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Figure 4 for Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Viaarxiv icon

Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 27, 2020
Xi Liu, Li Li, Ping-Chun Hsieh, Muhe Xie, Yong Ge, Rui Chen

Figure 1 for Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Figure 2 for Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Figure 3 for Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Figure 4 for Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Viaarxiv icon

Bandit Learning Through Biased Maximum Likelihood Estimation

Add code
Bookmark button
Alert button
Jul 23, 2019
Xi Liu, Ping-Chun Hsieh, Anirban Bhattacharya, P. R. Kumar

Figure 1 for Bandit Learning Through Biased Maximum Likelihood Estimation
Figure 2 for Bandit Learning Through Biased Maximum Likelihood Estimation
Figure 3 for Bandit Learning Through Biased Maximum Likelihood Estimation
Figure 4 for Bandit Learning Through Biased Maximum Likelihood Estimation
Viaarxiv icon

Streaming Network Embedding through Local Actions

Add code
Bookmark button
Alert button
Nov 14, 2018
Xi Liu, Ping-Chun Hsieh, Nick Duffield, Rui Chen, Muhe Xie, Xidao Wen

Figure 1 for Streaming Network Embedding through Local Actions
Figure 2 for Streaming Network Embedding through Local Actions
Figure 3 for Streaming Network Embedding through Local Actions
Figure 4 for Streaming Network Embedding through Local Actions
Viaarxiv icon

Heteroscedastic Bandits with Reneging

Add code
Bookmark button
Alert button
Oct 31, 2018
Ping-Chun Hsieh, Xi Liu, Anirban Bhattacharya, P. R. Kumar

Figure 1 for Heteroscedastic Bandits with Reneging
Figure 2 for Heteroscedastic Bandits with Reneging
Viaarxiv icon