Picture for Yuzhu Chen

Yuzhu Chen

Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

Add code
Feb 25, 2026
Viaarxiv icon

Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power

Add code
Dec 10, 2025
Viaarxiv icon

CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction

Add code
Sep 05, 2025
Figure 1 for CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction
Figure 2 for CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction
Viaarxiv icon

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Add code
Feb 26, 2025
Figure 1 for A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops
Viaarxiv icon

HRP: High-Rank Preheating for Superior LoRA Initialization

Add code
Feb 11, 2025
Viaarxiv icon

On Championing Foundation Models: From Explainability to Interpretability

Add code
Oct 15, 2024
Viaarxiv icon