Picture for Tongzheng Ren

Tongzheng Ren

Nearly Horizon-Free Offline Reinforcement Learning

Add code
Mar 25, 2021
Figure 1 for Nearly Horizon-Free Offline Reinforcement Learning
Figure 2 for Nearly Horizon-Free Offline Reinforcement Learning
Viaarxiv icon

Combinatorial Bandits without Total Order for Arms

Add code
Mar 03, 2021
Figure 1 for Combinatorial Bandits without Total Order for Arms
Figure 2 for Combinatorial Bandits without Total Order for Arms
Figure 3 for Combinatorial Bandits without Total Order for Arms
Figure 4 for Combinatorial Bandits without Total Order for Arms
Viaarxiv icon

Linear Bandit Algorithms with Sublinear Time Complexity

Add code
Mar 03, 2021
Figure 1 for Linear Bandit Algorithms with Sublinear Time Complexity
Viaarxiv icon

Accountable Off-Policy Evaluation With Kernel Bellman Statistics

Add code
Aug 15, 2020
Figure 1 for Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Figure 2 for Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Figure 3 for Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Figure 4 for Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Viaarxiv icon

Stein Self-Repulsive Dynamics: Benefits From Past Samples

Add code
Feb 21, 2020
Figure 1 for Stein Self-Repulsive Dynamics: Benefits From Past Samples
Figure 2 for Stein Self-Repulsive Dynamics: Benefits From Past Samples
Figure 3 for Stein Self-Repulsive Dynamics: Benefits From Past Samples
Figure 4 for Stein Self-Repulsive Dynamics: Benefits From Past Samples
Viaarxiv icon

MaxUp: A Simple Way to Improve Generalization of Neural Network Training

Add code
Feb 20, 2020
Figure 1 for MaxUp: A Simple Way to Improve Generalization of Neural Network Training
Figure 2 for MaxUp: A Simple Way to Improve Generalization of Neural Network Training
Figure 3 for MaxUp: A Simple Way to Improve Generalization of Neural Network Training
Figure 4 for MaxUp: A Simple Way to Improve Generalization of Neural Network Training
Viaarxiv icon

Implicit Regularization of Normalization Methods

Add code
Nov 23, 2019
Figure 1 for Implicit Regularization of Normalization Methods
Figure 2 for Implicit Regularization of Normalization Methods
Figure 3 for Implicit Regularization of Normalization Methods
Figure 4 for Implicit Regularization of Normalization Methods
Viaarxiv icon

Function Space Particle Optimization for Bayesian Neural Networks

Add code
Feb 26, 2019
Figure 1 for Function Space Particle Optimization for Bayesian Neural Networks
Figure 2 for Function Space Particle Optimization for Bayesian Neural Networks
Figure 3 for Function Space Particle Optimization for Bayesian Neural Networks
Figure 4 for Function Space Particle Optimization for Bayesian Neural Networks
Viaarxiv icon

Reward Shaping via Meta-Learning

Add code
Jan 27, 2019
Figure 1 for Reward Shaping via Meta-Learning
Figure 2 for Reward Shaping via Meta-Learning
Figure 3 for Reward Shaping via Meta-Learning
Figure 4 for Reward Shaping via Meta-Learning
Viaarxiv icon

Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information

Add code
Oct 10, 2018
Figure 1 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Figure 2 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Viaarxiv icon