Picture for Mingye Zhu

Mingye Zhu

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Add code
May 26, 2025
Viaarxiv icon

Leveraging Robust Optimization for LLM Alignment under Distribution Shifts

Add code
Apr 08, 2025
Viaarxiv icon

On-the-fly Preference Alignment via Principle-Guided Decoding

Add code
Feb 20, 2025
Viaarxiv icon

FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization

Add code
Oct 01, 2024
Figure 1 for FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Figure 2 for FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Figure 3 for FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Figure 4 for FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Viaarxiv icon

LIRE: listwise reward enhancement for preference alignment

Add code
May 22, 2024
Figure 1 for LIRE: listwise reward enhancement for preference alignment
Figure 2 for LIRE: listwise reward enhancement for preference alignment
Figure 3 for LIRE: listwise reward enhancement for preference alignment
Figure 4 for LIRE: listwise reward enhancement for preference alignment
Viaarxiv icon

SAGE-NDVI: A Stereotype-Breaking Evaluation Metric for Remote Sensing Image Dehazing Using Satellite-to-Ground NDVI Knowledge

Add code
Jun 09, 2023
Viaarxiv icon

Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression

Add code
Jun 16, 2021
Figure 1 for Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression
Figure 2 for Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression
Figure 3 for Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression
Figure 4 for Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression
Viaarxiv icon