Alert button
Picture for Yunhao Tang

Yunhao Tang

Alert button

Unlocking Pixels for Reinforcement Learning via Implicit Attention

Add code
Bookmark button
Alert button
Feb 08, 2021
Krzysztof Choromanski, Deepali Jain, Jack Parker-Holder, Xingyou Song, Valerii Likhosherstov, Anirban Santara, Aldo Pacchiano, Yunhao Tang, Adrian Weller

Figure 1 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 2 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 3 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Figure 4 for Unlocking Pixels for Reinforcement Learning via Implicit Attention
Viaarxiv icon

ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 19, 2021
Xingyou Song, Krzysztof Choromanski, Jack Parker-Holder, Yunhao Tang, Daiyi Peng, Deepali Jain, Wenbo Gao, Aldo Pacchiano, Tamas Sarlos, Yuxiang Yang

Figure 1 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 2 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 3 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Figure 4 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
Viaarxiv icon

Monte-Carlo Tree Search as Regularized Policy Optimization

Add code
Bookmark button
Alert button
Jul 24, 2020
Jean-Bastien Grill, Florent Altché, Yunhao Tang, Thomas Hubert, Michal Valko, Ioannis Antonoglou, Rémi Munos

Figure 1 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 2 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 3 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 4 for Monte-Carlo Tree Search as Regularized Policy Optimization
Viaarxiv icon

Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies

Add code
Bookmark button
Alert button
Jun 13, 2020
Yunhao Tang, Krzysztof Choromanski

Figure 1 for Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Figure 2 for Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Figure 3 for Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Figure 4 for Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Viaarxiv icon

Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 13, 2020
Yunhao Tang, Alp Kucukelbir

Figure 1 for Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Figure 2 for Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Figure 3 for Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Figure 4 for Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Viaarxiv icon

Self-Imitation Learning via Generalized Lower Bound Q-learning

Add code
Bookmark button
Alert button
Jun 12, 2020
Yunhao Tang

Figure 1 for Self-Imitation Learning via Generalized Lower Bound Q-learning
Figure 2 for Self-Imitation Learning via Generalized Lower Bound Q-learning
Figure 3 for Self-Imitation Learning via Generalized Lower Bound Q-learning
Figure 4 for Self-Imitation Learning via Generalized Lower Bound Q-learning
Viaarxiv icon

Taylor Expansion Policy Optimization

Add code
Bookmark button
Alert button
Mar 13, 2020
Yunhao Tang, Michal Valko, Rémi Munos

Figure 1 for Taylor Expansion Policy Optimization
Figure 2 for Taylor Expansion Policy Optimization
Figure 3 for Taylor Expansion Policy Optimization
Figure 4 for Taylor Expansion Policy Optimization
Viaarxiv icon

Discrete Action On-Policy Learning with Action-Value Critic

Add code
Bookmark button
Alert button
Feb 21, 2020
Yuguang Yue, Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Figure 1 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 2 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 3 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 4 for Discrete Action On-Policy Learning with Action-Value Critic
Viaarxiv icon

ES-MAML: Simple Hessian-Free Meta Learning

Add code
Bookmark button
Alert button
Oct 05, 2019
Xingyou Song, Wenbo Gao, Yuxiang Yang, Krzysztof Choromanski, Aldo Pacchiano, Yunhao Tang

Figure 1 for ES-MAML: Simple Hessian-Free Meta Learning
Figure 2 for ES-MAML: Simple Hessian-Free Meta Learning
Figure 3 for ES-MAML: Simple Hessian-Free Meta Learning
Figure 4 for ES-MAML: Simple Hessian-Free Meta Learning
Viaarxiv icon