Picture for Yanchao Sun

Yanchao Sun

Checklists Are Better Than Reward Models For Aligning Language Models

Add code
Jul 24, 2025
Viaarxiv icon

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Add code
Nov 01, 2024
Figure 1 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 2 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 3 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 4 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Viaarxiv icon

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Add code
Oct 06, 2024
Figure 1 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 2 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 3 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 4 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Viaarxiv icon

Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies

Add code
Feb 20, 2024
Figure 1 for Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Figure 2 for Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Figure 3 for Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Figure 4 for Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Viaarxiv icon

Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models

Add code
Feb 05, 2024
Figure 1 for Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
Figure 2 for Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
Figure 3 for Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
Figure 4 for Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
Viaarxiv icon

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Oct 22, 2023
Figure 1 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 2 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 3 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 4 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Viaarxiv icon

Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning

Add code
Oct 12, 2023
Figure 1 for Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning
Figure 2 for Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning
Figure 3 for Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning
Figure 4 for Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning
Viaarxiv icon

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Add code
Oct 11, 2023
Viaarxiv icon

Learning Generalizable Agents via Saliency-Guided Features Decorrelation

Add code
Oct 08, 2023
Figure 1 for Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Figure 2 for Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Figure 3 for Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Figure 4 for Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Viaarxiv icon