Picture for Mehrdad Farajtabar

Mehrdad Farajtabar

A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Add code
Jun 17, 2020
Figure 1 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Figure 2 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Viaarxiv icon

Understanding the Role of Training Regimes in Continual Learning

Add code
Jun 12, 2020
Figure 1 for Understanding the Role of Training Regimes in Continual Learning
Figure 2 for Understanding the Role of Training Regimes in Continual Learning
Figure 3 for Understanding the Role of Training Regimes in Continual Learning
Figure 4 for Understanding the Role of Training Regimes in Continual Learning
Viaarxiv icon

Learning to Incentivize Other Learning Agents

Add code
Jun 10, 2020
Figure 1 for Learning to Incentivize Other Learning Agents
Figure 2 for Learning to Incentivize Other Learning Agents
Figure 3 for Learning to Incentivize Other Learning Agents
Figure 4 for Learning to Incentivize Other Learning Agents
Viaarxiv icon

Dropout as an Implicit Gating Mechanism For Continual Learning

Add code
Apr 24, 2020
Figure 1 for Dropout as an Implicit Gating Mechanism For Continual Learning
Figure 2 for Dropout as an Implicit Gating Mechanism For Continual Learning
Figure 3 for Dropout as an Implicit Gating Mechanism For Continual Learning
Figure 4 for Dropout as an Implicit Gating Mechanism For Continual Learning
Viaarxiv icon

Self-Distillation Amplifies Regularization in Hilbert Space

Add code
Feb 25, 2020
Figure 1 for Self-Distillation Amplifies Regularization in Hilbert Space
Figure 2 for Self-Distillation Amplifies Regularization in Hilbert Space
Figure 3 for Self-Distillation Amplifies Regularization in Hilbert Space
Figure 4 for Self-Distillation Amplifies Regularization in Hilbert Space
Viaarxiv icon

Orthogonal Gradient Descent for Continual Learning

Add code
Oct 15, 2019
Figure 1 for Orthogonal Gradient Descent for Continual Learning
Figure 2 for Orthogonal Gradient Descent for Continual Learning
Figure 3 for Orthogonal Gradient Descent for Continual Learning
Figure 4 for Orthogonal Gradient Descent for Continual Learning
Viaarxiv icon

Cross-View Policy Learning for Street Navigation

Add code
Jun 13, 2019
Figure 1 for Cross-View Policy Learning for Street Navigation
Figure 2 for Cross-View Policy Learning for Street Navigation
Figure 3 for Cross-View Policy Learning for Street Navigation
Figure 4 for Cross-View Policy Learning for Street Navigation
Viaarxiv icon

Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher

Add code
Feb 09, 2019
Figure 1 for Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher
Figure 2 for Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher
Figure 3 for Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher
Figure 4 for Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher
Viaarxiv icon

More Robust Doubly Robust Off-policy Evaluation

Add code
May 23, 2018
Figure 1 for More Robust Doubly Robust Off-policy Evaluation
Figure 2 for More Robust Doubly Robust Off-policy Evaluation
Figure 3 for More Robust Doubly Robust Off-policy Evaluation
Figure 4 for More Robust Doubly Robust Off-policy Evaluation
Viaarxiv icon

Representation Learning over Dynamic Graphs

Add code
Mar 16, 2018
Figure 1 for Representation Learning over Dynamic Graphs
Figure 2 for Representation Learning over Dynamic Graphs
Figure 3 for Representation Learning over Dynamic Graphs
Figure 4 for Representation Learning over Dynamic Graphs
Viaarxiv icon