Picture for Shujian Zhang

Shujian Zhang

Optimized scheduling of electricity-heat cooperative system considering wind energy consumption and peak shaving and valley filling

Add code
Nov 19, 2025
Viaarxiv icon

Principled Foundations for Preference Optimization

Add code
Jul 10, 2025
Viaarxiv icon

T-REG: Preference Optimization with Token-Level Reward Regularization

Add code
Dec 03, 2024
Viaarxiv icon

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Add code
Oct 09, 2024
Figure 1 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 2 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 3 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 4 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Viaarxiv icon

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Add code
Oct 07, 2024
Figure 1 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 2 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 3 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 4 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Viaarxiv icon

Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models

Add code
Sep 17, 2024
Figure 1 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 2 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 3 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 4 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Viaarxiv icon

WPO: Enhancing RLHF with Weighted Preference Optimization

Add code
Jun 17, 2024
Viaarxiv icon

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Add code
May 23, 2024
Figure 1 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 2 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 3 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 4 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Viaarxiv icon

Switchable Decision: Dynamic Neural Generation Networks

Add code
May 07, 2024
Viaarxiv icon

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Add code
Mar 25, 2024
Viaarxiv icon