Alert button
Picture for Quanquan Gu

Quanquan Gu

Alert button

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Jan 02, 2024
Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu

Viaarxiv icon

Sparse PCA with Oracle Property

Dec 28, 2023
Quanquan Gu, Zhaoran Wang, Han Liu

Figure 1 for Sparse PCA with Oracle Property
Viaarxiv icon

Fast Sampling via De-randomization for Discrete Diffusion Models

Dec 14, 2023
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu

Viaarxiv icon

A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation

Nov 26, 2023
Heyang Zhao, Jiafan He, Quanquan Gu

Viaarxiv icon

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Nov 23, 2023
Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu

Viaarxiv icon

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Nov 07, 2023
Yihe Deng, Weitong Zhang, Zixiang Chen, Quanquan Gu

Viaarxiv icon

Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

Oct 29, 2023
Yiwen Kou, Zixiang Chen, Quanquan Gu

Viaarxiv icon

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

Oct 23, 2023
Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang

Viaarxiv icon

Pure Exploration in Asynchronous Federated Bandits

Oct 17, 2023
Zichen Wang, Chuanhao Li, Chenyu Song, Lianghui Wang, Quanquan Gu, Huazheng Wang

Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Oct 12, 2023
Jingfeng Wu, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Peter L. Bartlett

Viaarxiv icon