Picture for Zheng Wen

Zheng Wen

Stochastic Online Learning with Probabilistic Graph Feedback

Add code
Mar 04, 2019
Viaarxiv icon

Scalable Thompson Sampling via Optimal Transport

Add code
Feb 19, 2019
Figure 1 for Scalable Thompson Sampling via Optimal Transport
Figure 2 for Scalable Thompson Sampling via Optimal Transport
Figure 3 for Scalable Thompson Sampling via Optimal Transport
Figure 4 for Scalable Thompson Sampling via Optimal Transport
Viaarxiv icon

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

Add code
Nov 13, 2018
Figure 1 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Figure 2 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Viaarxiv icon

Online Diverse Learning to Rank from Partial-Click Feedback

Add code
Nov 01, 2018
Figure 1 for Online Diverse Learning to Rank from Partial-Click Feedback
Figure 2 for Online Diverse Learning to Rank from Partial-Click Feedback
Figure 3 for Online Diverse Learning to Rank from Partial-Click Feedback
Figure 4 for Online Diverse Learning to Rank from Partial-Click Feedback
Viaarxiv icon

Posterior Sampling for Large Scale Reinforcement Learning

Add code
Oct 22, 2018
Figure 1 for Posterior Sampling for Large Scale Reinforcement Learning
Figure 2 for Posterior Sampling for Large Scale Reinforcement Learning
Figure 3 for Posterior Sampling for Large Scale Reinforcement Learning
Viaarxiv icon

Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback

Add code
Jun 19, 2018
Figure 1 for Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback
Figure 2 for Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback
Figure 3 for Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback
Viaarxiv icon

Offline Evaluation of Ranking Policies with Click Models

Add code
Jun 13, 2018
Figure 1 for Offline Evaluation of Ranking Policies with Click Models
Figure 2 for Offline Evaluation of Ranking Policies with Click Models
Figure 3 for Offline Evaluation of Ranking Policies with Click Models
Figure 4 for Offline Evaluation of Ranking Policies with Click Models
Viaarxiv icon

Deep Exploration via Randomized Value Functions

Add code
Jun 06, 2018
Figure 1 for Deep Exploration via Randomized Value Functions
Figure 2 for Deep Exploration via Randomized Value Functions
Figure 3 for Deep Exploration via Randomized Value Functions
Figure 4 for Deep Exploration via Randomized Value Functions
Viaarxiv icon

Conservative Exploration using Interleaving

Add code
Jun 03, 2018
Figure 1 for Conservative Exploration using Interleaving
Viaarxiv icon

Model-Independent Online Learning for Influence Maximization

Add code
May 24, 2018
Figure 1 for Model-Independent Online Learning for Influence Maximization
Figure 2 for Model-Independent Online Learning for Influence Maximization
Figure 3 for Model-Independent Online Learning for Influence Maximization
Viaarxiv icon