Picture for Jiancong Xiao

Jiancong Xiao

Fundamental Limits of Game-Theoretic LLM Alignment: Smith Consistency and Preference Matching

Add code
May 27, 2025
Viaarxiv icon

Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

Add code
May 04, 2025
Viaarxiv icon

Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium

Add code
Mar 14, 2025
Viaarxiv icon

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Figure 1 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 2 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 3 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 4 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic

Add code
Jul 09, 2024
Viaarxiv icon

Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Add code
Jun 08, 2024
Viaarxiv icon

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Add code
May 26, 2024
Viaarxiv icon

Uniformly Stable Algorithms for Adversarial Training and Beyond

Add code
May 03, 2024
Figure 1 for Uniformly Stable Algorithms for Adversarial Training and Beyond
Figure 2 for Uniformly Stable Algorithms for Adversarial Training and Beyond
Figure 3 for Uniformly Stable Algorithms for Adversarial Training and Beyond
Figure 4 for Uniformly Stable Algorithms for Adversarial Training and Beyond
Viaarxiv icon

PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

Add code
Oct 09, 2023
Figure 1 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 2 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 3 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Viaarxiv icon