Alert button
Picture for Zixiang Chen

Zixiang Chen

Alert button

Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

Add code
Bookmark button
Alert button
Apr 18, 2024
Yiwen Kou, Zixiang Chen, Quanquan Gu, Sham M. Kakade

Viaarxiv icon

Guided Discrete Diffusion for Electronic Health Record Generation

Add code
Bookmark button
Alert button
Apr 18, 2024
Zixiang Chen, Jun Han, Yongqian Li, Yiwen Kou, Eran Halperin, Robert E. Tillman, Quanquan Gu

Viaarxiv icon

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Add code
Bookmark button
Alert button
Feb 15, 2024
Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu

Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Bookmark button
Alert button
Jan 02, 2024
Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu

Viaarxiv icon

Fast Sampling via De-randomization for Discrete Diffusion Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu

Viaarxiv icon

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Add code
Bookmark button
Alert button
Nov 07, 2023
Yihe Deng, Weitong Zhang, Zixiang Chen, Quanquan Gu

Viaarxiv icon

Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

Add code
Bookmark button
Alert button
Oct 29, 2023
Yiwen Kou, Zixiang Chen, Quanquan Gu

Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Add code
Bookmark button
Alert button
Oct 12, 2023
Jingfeng Wu, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Peter L. Bartlett

Viaarxiv icon

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Add code
Bookmark button
Alert button
Oct 11, 2023
Zixiang Chen, Junkai Zhang, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu

Figure 1 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 2 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 3 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 4 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Add code
Bookmark button
Alert button
Oct 02, 2023
Zixiang Chen, Yihe Deng, Yuanzhi Li, Quanquan Gu

Figure 1 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 2 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 3 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 4 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Viaarxiv icon