Picture for Zongzhang Zhang

Zongzhang Zhang

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

Add code
Jul 05, 2024
Viaarxiv icon

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

Add code
Jul 04, 2024
Figure 1 for Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Figure 2 for Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Figure 3 for Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Figure 4 for Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Viaarxiv icon

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function

Add code
Jul 04, 2024
Figure 1 for Q-Adapter: Training Your LLM Adapter as a Residual Q-Function
Figure 2 for Q-Adapter: Training Your LLM Adapter as a Residual Q-Function
Figure 3 for Q-Adapter: Training Your LLM Adapter as a Residual Q-Function
Figure 4 for Q-Adapter: Training Your LLM Adapter as a Residual Q-Function
Viaarxiv icon

$\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning

Add code
Jun 26, 2024
Viaarxiv icon

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Mar 12, 2024
Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Reinforced In-Context Black-Box Optimization

Add code
Feb 27, 2024
Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Feb 17, 2024
Viaarxiv icon

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Add code
Dec 26, 2023
Figure 1 for Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Figure 2 for Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Figure 3 for Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Figure 4 for Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Viaarxiv icon

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Add code
Oct 09, 2023
Figure 1 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 2 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 3 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Figure 4 for Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Viaarxiv icon

ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning

Add code
Sep 12, 2023
Viaarxiv icon