Alert button
Picture for Yaosheng Xu

Yaosheng Xu

Alert button

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Sep 16, 2022
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Target Entropy Annealing for Discrete Soft Actor-Critic

Dec 06, 2021
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox

Figure 1 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 2 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 3 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 4 for Target Entropy Annealing for Discrete Soft Actor-Critic
Viaarxiv icon

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Oct 28, 2021
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 2 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 3 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 4 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Viaarxiv icon