Picture for Zheng Wen

Zheng Wen

Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization

Add code
Jul 06, 2016
Figure 1 for Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Figure 2 for Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Figure 3 for Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Figure 4 for Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
Viaarxiv icon

Cascading Bandits for Large-Scale Recommendation Problems

Add code
Jun 30, 2016
Figure 1 for Cascading Bandits for Large-Scale Recommendation Problems
Figure 2 for Cascading Bandits for Large-Scale Recommendation Problems
Viaarxiv icon

DCM Bandits: Learning to Rank with Multiple Clicks

Add code
May 31, 2016
Figure 1 for DCM Bandits: Learning to Rank with Multiple Clicks
Figure 2 for DCM Bandits: Learning to Rank with Multiple Clicks
Figure 3 for DCM Bandits: Learning to Rank with Multiple Clicks
Viaarxiv icon

Generalization and Exploration via Randomized Value Functions

Add code
Feb 15, 2016
Figure 1 for Generalization and Exploration via Randomized Value Functions
Figure 2 for Generalization and Exploration via Randomized Value Functions
Figure 3 for Generalization and Exploration via Randomized Value Functions
Figure 4 for Generalization and Exploration via Randomized Value Functions
Viaarxiv icon

Combinatorial Cascading Bandits

Add code
Nov 17, 2015
Figure 1 for Combinatorial Cascading Bandits
Figure 2 for Combinatorial Cascading Bandits
Figure 3 for Combinatorial Cascading Bandits
Viaarxiv icon

Cascading Bandits: Learning to Rank in the Cascade Model

Add code
May 18, 2015
Figure 1 for Cascading Bandits: Learning to Rank in the Cascade Model
Figure 2 for Cascading Bandits: Learning to Rank in the Cascade Model
Figure 3 for Cascading Bandits: Learning to Rank in the Cascade Model
Viaarxiv icon

Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits

Add code
Jan 27, 2015
Figure 1 for Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
Viaarxiv icon

Learning to Act Greedily: Polymatroid Semi-Bandits

Add code
Nov 21, 2014
Figure 1 for Learning to Act Greedily: Polymatroid Semi-Bandits
Figure 2 for Learning to Act Greedily: Polymatroid Semi-Bandits
Figure 3 for Learning to Act Greedily: Polymatroid Semi-Bandits
Figure 4 for Learning to Act Greedily: Polymatroid Semi-Bandits
Viaarxiv icon

DUM: Diversity-Weighted Utility Maximization for Recommendations

Add code
Nov 13, 2014
Figure 1 for DUM: Diversity-Weighted Utility Maximization for Recommendations
Figure 2 for DUM: Diversity-Weighted Utility Maximization for Recommendations
Figure 3 for DUM: Diversity-Weighted Utility Maximization for Recommendations
Figure 4 for DUM: Diversity-Weighted Utility Maximization for Recommendations
Viaarxiv icon

Optimal Demand Response Using Device Based Reinforcement Learning

Add code
Jun 28, 2014
Figure 1 for Optimal Demand Response Using Device Based Reinforcement Learning
Figure 2 for Optimal Demand Response Using Device Based Reinforcement Learning
Figure 3 for Optimal Demand Response Using Device Based Reinforcement Learning
Figure 4 for Optimal Demand Response Using Device Based Reinforcement Learning
Viaarxiv icon