Picture for Bingcong Li

Bingcong Li

Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization

Add code
Apr 16, 2026
Viaarxiv icon

Zeroth-Order Optimization at the Edge of Stability

Add code
Apr 16, 2026
Viaarxiv icon

Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation

Add code
Apr 14, 2026
Viaarxiv icon

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Add code
Apr 03, 2026
Viaarxiv icon

ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling

Add code
Feb 09, 2026
Viaarxiv icon

SALAAD: Sparse And Low-Rank Adaptation via ADMM

Add code
Feb 01, 2026
Viaarxiv icon

Zeroth-Order Optimization Finds Flat Minima

Add code
Jun 05, 2025
Viaarxiv icon

RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models

Add code
May 24, 2025
Viaarxiv icon

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Add code
Feb 26, 2025
Viaarxiv icon

Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm

Add code
Jan 11, 2025
Figure 1 for Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Figure 2 for Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Figure 3 for Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Figure 4 for Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Viaarxiv icon