Picture for Yu-Yang Qian

Yu-Yang Qian

TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree

Add code
Jun 12, 2025
Viaarxiv icon

Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits

Add code
Feb 11, 2025
Viaarxiv icon