Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Qingpeng Cai

Softmax Deep Double Deterministic Policy Gradients


Oct 19, 2020
Ling Pan, Qingpeng Cai, Longbo Huang

* NeurIPS 2020 

  Access Paper or Ask Questions

Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce


May 25, 2020
Jianxiong Wei, Anxiang Zeng, Yueqiu Wu, Peng Guo, Qingsong Hua, Qingpeng Cai


  Access Paper or Ask Questions

Multi-Path Policy Optimization


Nov 22, 2019
Ling Pan, Qingpeng Cai, Longbo Huang


  Access Paper or Ask Questions

Deterministic Value-Policy Gradients


Sep 09, 2019
Qingpeng Cai, Ling Pan, Pingzhong Tang


  Access Paper or Ask Questions

Reinforcement Learning Driven Heuristic Optimization


Jun 16, 2019
Qingpeng Cai, Will Hang, Azalia Mirhoseini, George Tucker, Jingtao Wang, Wei Wei

* DRL4KDD'19 

  Access Paper or Ask Questions

Reinforcement Learning with Dynamic Boltzmann Softmax Updates


Mar 15, 2019
Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang, Tie-Yan Liu


  Access Paper or Ask Questions

Policy Optimization with Model-based Explorations


Nov 18, 2018
Feiyang Pan, Qingpeng Cai, An-Xiang Zeng, Chun-Xiang Pan, Qing Da, Hualin He, Qing He, Pingzhong Tang

* Accepted at AAAI-19 

  Access Paper or Ask Questions

Deterministic Policy Gradients With General State Transitions


Oct 02, 2018
Qingpeng Cai, Ling Pan, Pingzhong Tang


  Access Paper or Ask Questions

Rebalancing Dockless Bike Sharing Systems


Sep 10, 2018
Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang


  Access Paper or Ask Questions

Policy Gradients for General Contextual Bandits


May 22, 2018
Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He


  Access Paper or Ask Questions

Reinforcement Mechanism Design for e-commerce


Feb 27, 2018
Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang


  Access Paper or Ask Questions