Picture for Yuang Cai

Yuang Cai

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

Add code
Feb 13, 2026
Viaarxiv icon

Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation

Add code
Dec 10, 2024
Viaarxiv icon

Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment

Add code
Nov 14, 2024
Viaarxiv icon