Alert button
Picture for Yichuan Deng

Yichuan Deng

Alert button

Attention is Naturally Sparse with Gaussian Distributed Input

Add code
Bookmark button
Alert button
Apr 03, 2024
Yichuan Deng, Zhao Song, Chiwun Yang

Viaarxiv icon

Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence

Add code
Bookmark button
Alert button
Feb 02, 2024
Yichuan Deng, Zhao Song, Chiwun Yang

Viaarxiv icon

Unmasking Transformers: A Theoretical Approach to Data Recovery via Attention Weights

Add code
Bookmark button
Alert button
Oct 19, 2023
Yichuan Deng, Zhao Song, Shenghao Xie, Chiwun Yang

Viaarxiv icon

Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention

Add code
Bookmark button
Alert button
Oct 18, 2023
Yichuan Deng, Zhao Song, Tianyi Zhou

Figure 1 for Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention
Figure 2 for Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention
Figure 3 for Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention
Figure 4 for Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention
Viaarxiv icon

Clustered Linear Contextual Bandits with Knapsacks

Add code
Bookmark button
Alert button
Aug 21, 2023
Yichuan Deng, Michalis Mamakos, Zhao Song

Viaarxiv icon

Convergence of Two-Layer Regression with Nonlinear Units

Add code
Bookmark button
Alert button
Aug 16, 2023
Yichuan Deng, Zhao Song, Shenghao Xie

Viaarxiv icon

Zero-th Order Algorithm for Softmax Attention Optimization

Add code
Bookmark button
Alert button
Jul 17, 2023
Yichuan Deng, Zhihang Li, Sridhar Mahadevan, Zhao Song

Viaarxiv icon

Faster Robust Tensor Power Method for Arbitrary Order

Add code
Bookmark button
Alert button
Jun 01, 2023
Yichuan Deng, Zhao Song, Junze Yin

Viaarxiv icon

Attention Scheme Inspired Softmax Regression

Add code
Bookmark button
Alert button
Apr 26, 2023
Yichuan Deng, Zhihang Li, Zhao Song

Viaarxiv icon