Picture for Yanwei Ren

Yanwei Ren

LARGO: Low-Rank Regulated Gradient Projection for Robust Parameter Efficient Fine-Tuning

Add code
Jun 14, 2025
Viaarxiv icon

SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation

Add code
Jun 06, 2025
Viaarxiv icon