Picture for Zichuan Lin

Zichuan Lin

PIPCFR: Pseudo-outcome Imputation with Post-treatment Variables for Individual Treatment Effect Estimation

Add code
Dec 21, 2025
Viaarxiv icon

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

Add code
Nov 19, 2025
Viaarxiv icon

Multi-agent In-context Coordination via Decentralized Memory Retrieval

Add code
Nov 13, 2025
Figure 1 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 2 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 3 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 4 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Viaarxiv icon

CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks

Add code
Aug 26, 2025
Viaarxiv icon

Learning Versatile Skills with Curriculum Masking

Add code
Oct 23, 2024
Figure 1 for Learning Versatile Skills with Curriculum Masking
Figure 2 for Learning Versatile Skills with Curriculum Masking
Figure 3 for Learning Versatile Skills with Curriculum Masking
Figure 4 for Learning Versatile Skills with Curriculum Masking
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Replay-enhanced Continual Reinforcement Learning
Figure 2 for Replay-enhanced Continual Reinforcement Learning
Figure 3 for Replay-enhanced Continual Reinforcement Learning
Figure 4 for Replay-enhanced Continual Reinforcement Learning
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Add code
Feb 05, 2023
Viaarxiv icon

A Survey on Transformers in Reinforcement Learning

Add code
Jan 08, 2023
Viaarxiv icon

Pretraining in Deep Reinforcement Learning: A Survey

Add code
Nov 08, 2022
Viaarxiv icon