Picture for Ruijie Zheng

Ruijie Zheng

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Add code
Feb 22, 2024
Figure 1 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Figure 2 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Figure 3 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Figure 4 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Viaarxiv icon

PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem

Add code
Feb 16, 2024
Viaarxiv icon

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Add code
Feb 13, 2024
Viaarxiv icon

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

Add code
Oct 30, 2023
Figure 1 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 2 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 3 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 4 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Viaarxiv icon

Progressively Efficient Learning

Oct 13, 2023
Figure 1 for Progressively Efficient Learning
Figure 2 for Progressively Efficient Learning
Figure 3 for Progressively Efficient Learning
Figure 4 for Progressively Efficient Learning
Viaarxiv icon

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Oct 11, 2023
Viaarxiv icon

Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

Add code
Sep 07, 2023
Figure 1 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 2 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 3 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 4 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Viaarxiv icon

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Jul 22, 2023
Figure 1 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 2 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 3 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 4 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Viaarxiv icon

Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Add code
Jul 18, 2023
Figure 1 for Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Figure 2 for Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Figure 3 for Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Figure 4 for Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Viaarxiv icon

TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

Add code
Jun 22, 2023
Figure 1 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 2 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 3 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 4 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Viaarxiv icon