Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games


Aug 19, 2022
Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

BBTv2: Pure Black-Box Optimization Can Be Comparable to Gradient Descent for Few-Shot Learning


May 23, 2022
Tianxiang Sun, Zhengfu He, Hong Qian, Xuanjing Huang, Xipeng Qiu

* Work in progress. Code is publicly available at https://github.com/txsun1997/Black-Box-Tuning 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Black-Box Tuning for Language-Model-as-a-Service


Feb 08, 2022
Tianxiang Sun, Yunfan Shao, Hong Qian, Xuanjing Huang, Xipeng Qiu

* 14 pages. Code is available at https://github.com/txsun1997/Black-Box-Tuning 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Derivative-Free Reinforcement Learning: A Review


Feb 10, 2021
Hong Qian, Yang Yu

* This article has been accepted by Frontiers of Computer Science in 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ZOOpt: Toolbox for Derivative-Free Optimization


Feb 06, 2018
Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu, Chao Qian


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Estimate exponential memory decay in Hidden Markov Model and its applications


Oct 17, 2017
Felix X. -F. Ye, Yi-an Ma, Hong Qian


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Sampling-and-Learning Framework: A Statistical View of Evolutionary Algorithms


Apr 11, 2014
Yang Yu, Hong Qian


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email