Picture for Ruqi Zhang

Ruqi Zhang

Stacey: Promoting Stochastic Steepest Descent via Accelerated $\ell_p$-Smooth Nonconvex Optimization

Add code
Jun 07, 2025
Viaarxiv icon

Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding

Add code
May 30, 2025
Viaarxiv icon

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

Entropy-Guided Sampling of Flat Modes in Discrete Spaces

Add code
May 05, 2025
Viaarxiv icon

Energy-Based Reward Models for Robust Language Model Alignment

Add code
Apr 17, 2025
Viaarxiv icon

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

Add code
Apr 03, 2025
Viaarxiv icon

Reheated Gradient-based Discrete Sampling for Combinatorial Optimization

Add code
Mar 06, 2025
Viaarxiv icon

Optimal Stochastic Trace Estimation in Generative Modeling

Add code
Feb 26, 2025
Viaarxiv icon

Bayesian Computation in Deep Learning

Add code
Feb 26, 2025
Viaarxiv icon

CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought

Add code
Feb 24, 2025
Viaarxiv icon