Alert button
Picture for Dailin Hu

Dailin Hu

Alert button

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Bookmark button
Alert button
Sep 16, 2022
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Target Entropy Annealing for Discrete Soft Actor-Critic

Add code
Bookmark button
Alert button
Dec 06, 2021
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox

Figure 1 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 2 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 3 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 4 for Target Entropy Annealing for Discrete Soft Actor-Critic
Viaarxiv icon

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 28, 2021
Dailin Hu, Pieter Abbeel, Roy Fox

Figure 1 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 2 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 3 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 4 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Viaarxiv icon

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Add code
Bookmark button
Alert button
Oct 28, 2021
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 2 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 3 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 4 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Viaarxiv icon