Alert button
Picture for Zheng Wen

Zheng Wen

Alert button

Structured Policy Iteration for Linear Quadratic Regulator

Add code
Bookmark button
Alert button
Jul 13, 2020
Youngsuk Park, Ryan A. Rossi, Zheng Wen, Gang Wu, Handong Zhao

Figure 1 for Structured Policy Iteration for Linear Quadratic Regulator
Figure 2 for Structured Policy Iteration for Linear Quadratic Regulator
Figure 3 for Structured Policy Iteration for Linear Quadratic Regulator
Figure 4 for Structured Policy Iteration for Linear Quadratic Regulator
Viaarxiv icon

Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems

Add code
Bookmark button
Alert button
Jul 09, 2020
Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel

Figure 1 for Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Figure 2 for Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Figure 3 for Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Figure 4 for Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Viaarxiv icon

Hypermodels for Exploration

Add code
Bookmark button
Alert button
Jun 12, 2020
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

Figure 1 for Hypermodels for Exploration
Figure 2 for Hypermodels for Exploration
Figure 3 for Hypermodels for Exploration
Figure 4 for Hypermodels for Exploration
Viaarxiv icon

Improving Adversarial Text Generation by Modeling the Distant Future

Add code
Bookmark button
Alert button
May 04, 2020
Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

Figure 1 for Improving Adversarial Text Generation by Modeling the Distant Future
Figure 2 for Improving Adversarial Text Generation by Modeling the Distant Future
Figure 3 for Improving Adversarial Text Generation by Modeling the Distant Future
Figure 4 for Improving Adversarial Text Generation by Modeling the Distant Future
Viaarxiv icon

Nested-Wasserstein Self-Imitation Learning for Sequence Generation

Add code
Bookmark button
Alert button
Jan 20, 2020
Ruiyi Zhang, Changyou Chen, Zhe Gan, Zheng Wen, Wenlin Wang, Lawrence Carin

Figure 1 for Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Figure 2 for Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Figure 3 for Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Figure 4 for Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Viaarxiv icon

Bootstrapping Upper Confidence Bound

Add code
Bookmark button
Alert button
Jul 23, 2019
Botao Hao, Yasin Abbasi-Yadkori, Zheng Wen, Guang Cheng

Figure 1 for Bootstrapping Upper Confidence Bound
Figure 2 for Bootstrapping Upper Confidence Bound
Figure 3 for Bootstrapping Upper Confidence Bound
Figure 4 for Bootstrapping Upper Confidence Bound
Viaarxiv icon

Waterfall Bandits: Learning to Sell Ads Online

Add code
Bookmark button
Alert button
Apr 20, 2019
Branislav Kveton, Saied Mahdian, S. Muthukrishnan, Zheng Wen, Yikun Xian

Figure 1 for Waterfall Bandits: Learning to Sell Ads Online
Figure 2 for Waterfall Bandits: Learning to Sell Ads Online
Figure 3 for Waterfall Bandits: Learning to Sell Ads Online
Figure 4 for Waterfall Bandits: Learning to Sell Ads Online
Viaarxiv icon

Stochastic Online Learning with Probabilistic Graph Feedback

Add code
Bookmark button
Alert button
Mar 04, 2019
Shuai Li, Wei Chen, Zheng Wen, Kwong-Sak Leung

Viaarxiv icon

Scalable Thompson Sampling via Optimal Transport

Add code
Bookmark button
Alert button
Feb 19, 2019
Ruiyi Zhang, Zheng Wen, Changyou Chen, Lawrence Carin

Figure 1 for Scalable Thompson Sampling via Optimal Transport
Figure 2 for Scalable Thompson Sampling via Optimal Transport
Figure 3 for Scalable Thompson Sampling via Optimal Transport
Figure 4 for Scalable Thompson Sampling via Optimal Transport
Viaarxiv icon