Alert button
Picture for Xiaolin Sun

Xiaolin Sun

Alert button

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Add code
Bookmark button
Alert button
Mar 06, 2024
Xiaolin Sun, Zizhan Zheng

Figure 1 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 2 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 3 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 4 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Viaarxiv icon

Enhancing LLM Safety via Constrained Direct Preference Optimization

Add code
Bookmark button
Alert button
Mar 04, 2024
Zixuan Liu, Xiaolin Sun, Zizhan Zheng

Figure 1 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 2 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 3 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 4 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Viaarxiv icon

Pandering in a Flexible Representative Democracy

Add code
Bookmark button
Alert button
Nov 18, 2022
Xiaolin Sun, Jacob Masur, Ben Abramowitz, Nicholas Mattei, Zizhan Zheng

Figure 1 for Pandering in a Flexible Representative Democracy
Figure 2 for Pandering in a Flexible Representative Democracy
Figure 3 for Pandering in a Flexible Representative Democracy
Figure 4 for Pandering in a Flexible Representative Democracy
Viaarxiv icon

An exact solution in Markov decision process with multiplicative rewards as a general framework

Add code
Bookmark button
Alert button
Dec 15, 2020
Yuan Yao, Xiaolin Sun

Viaarxiv icon

Leveraging Legacy Data to Accelerate Materials Design via Preference Learning

Add code
Bookmark button
Alert button
Oct 25, 2019
Xiaolin Sun, Zhufeng Hou, Masato Sumita, Shinsuke Ishihara, Ryo Tamura, Koji Tsuda

Figure 1 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 2 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 3 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 4 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Viaarxiv icon