Picture for Xiaolin Sun

Xiaolin Sun

Robust Optimization for Mitigating Reward Hacking with Correlated Proxies

Add code
Apr 13, 2026
Viaarxiv icon

Diffusion Guided Adversarial State Perturbations in Reinforcement Learning

Add code
Nov 10, 2025
Figure 1 for Diffusion Guided Adversarial State Perturbations in Reinforcement Learning
Figure 2 for Diffusion Guided Adversarial State Perturbations in Reinforcement Learning
Figure 3 for Diffusion Guided Adversarial State Perturbations in Reinforcement Learning
Figure 4 for Diffusion Guided Adversarial State Perturbations in Reinforcement Learning
Viaarxiv icon

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Add code
Mar 06, 2024
Figure 1 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 2 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 3 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 4 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Viaarxiv icon

Enhancing LLM Safety via Constrained Direct Preference Optimization

Add code
Mar 04, 2024
Figure 1 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 2 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 3 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 4 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Viaarxiv icon

Pandering in a Flexible Representative Democracy

Add code
Nov 18, 2022
Figure 1 for Pandering in a Flexible Representative Democracy
Figure 2 for Pandering in a Flexible Representative Democracy
Figure 3 for Pandering in a Flexible Representative Democracy
Viaarxiv icon

An exact solution in Markov decision process with multiplicative rewards as a general framework

Add code
Dec 15, 2020
Viaarxiv icon

Leveraging Legacy Data to Accelerate Materials Design via Preference Learning

Add code
Oct 25, 2019
Figure 1 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 2 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 3 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 4 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Viaarxiv icon