Picture for Haoyuan Sun

Haoyuan Sun

Distribution Preference Optimization: A Fine-grained Perspective for LLM Unlearning

Add code
Oct 06, 2025
Viaarxiv icon

Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Add code
May 24, 2025
Viaarxiv icon

In-Context Learning of Polynomial Kernel Regression in Transformers with GLU Layers

Add code
Jan 30, 2025
Viaarxiv icon

Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization

Add code
Sep 15, 2024
Figure 1 for Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization
Figure 2 for Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization
Figure 3 for Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization
Figure 4 for Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization
Viaarxiv icon

A Method on Searching Better Activation Functions

Add code
May 22, 2024
Figure 1 for A Method on Searching Better Activation Functions
Figure 2 for A Method on Searching Better Activation Functions
Figure 3 for A Method on Searching Better Activation Functions
Figure 4 for A Method on Searching Better Activation Functions
Viaarxiv icon

Improving Offline Reinforcement Learning with Inaccurate Simulators

Add code
May 07, 2024
Figure 1 for Improving Offline Reinforcement Learning with Inaccurate Simulators
Figure 2 for Improving Offline Reinforcement Learning with Inaccurate Simulators
Figure 3 for Improving Offline Reinforcement Learning with Inaccurate Simulators
Figure 4 for Improving Offline Reinforcement Learning with Inaccurate Simulators
Viaarxiv icon

A least-square method for non-asymptotic identification in linear switching control

Add code
Apr 11, 2024
Figure 1 for A least-square method for non-asymptotic identification in linear switching control
Viaarxiv icon

Private Synthetic Data Meets Ensemble Learning

Add code
Oct 15, 2023
Viaarxiv icon

A Unified Approach to Controlling Implicit Regularization via Mirror Descent

Add code
Jun 24, 2023
Figure 1 for A Unified Approach to Controlling Implicit Regularization via Mirror Descent
Figure 2 for A Unified Approach to Controlling Implicit Regularization via Mirror Descent
Figure 3 for A Unified Approach to Controlling Implicit Regularization via Mirror Descent
Figure 4 for A Unified Approach to Controlling Implicit Regularization via Mirror Descent
Viaarxiv icon

Online Learning for Equilibrium Pricing in Markets under Incomplete Information

Add code
Mar 28, 2023
Viaarxiv icon