Picture for Yangchen Pan

Yangchen Pan

Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning

Add code
Jun 25, 2026
Viaarxiv icon

NEXUS: Neural Energy Fields for Physically Consistent Contact-Rich 3D Object Dynamics

Add code
Jun 18, 2026
Viaarxiv icon

Temporal Difference Learning for Diffusion Models

Add code
Jun 13, 2026
Viaarxiv icon

Gradient Residual Connections

Add code
Feb 09, 2026
Viaarxiv icon

Measures of Variability for Risk-averse Policy Gradient

Add code
Apr 15, 2025
Viaarxiv icon

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Add code
Feb 04, 2025
Figure 1 for PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
Figure 2 for PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
Figure 3 for PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
Figure 4 for PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
Viaarxiv icon

DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime

Add code
May 28, 2024
Figure 1 for DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Figure 2 for DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Figure 3 for DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Figure 4 for DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Viaarxiv icon

Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination

Add code
May 28, 2024
Figure 1 for Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Figure 2 for Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Figure 3 for Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Figure 4 for Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Viaarxiv icon

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models

Add code
Apr 23, 2024
Figure 1 for An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models
Figure 2 for An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models
Figure 3 for An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models
Figure 4 for An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models
Viaarxiv icon

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Add code
Mar 20, 2024
Figure 1 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 2 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 3 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 4 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Viaarxiv icon