Alert button
Picture for Zhao Song

Zhao Song

Alert button

Attention is Naturally Sparse with Gaussian Distributed Input

Add code
Bookmark button
Alert button
Apr 03, 2024
Yichuan Deng, Zhao Song, Chiwun Yang

Viaarxiv icon

On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis

Add code
Bookmark button
Alert button
Feb 14, 2024
Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song, Han Liu

Viaarxiv icon

Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic

Add code
Bookmark button
Alert button
Feb 12, 2024
Jiuxiang Gu, Chenyang Li, Yingyu Liang, Zhenmei Shi, Zhao Song, Tianyi Zhou

Viaarxiv icon

Quantum Speedup for Spectral Approximation of Kronecker Products

Add code
Bookmark button
Alert button
Feb 10, 2024
Yeqi Gao, Zhao Song, Ruizhe Zhang

Viaarxiv icon

The Fine-Grained Complexity of Gradient Computation for Training Large Language Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Josh Alman, Zhao Song

Viaarxiv icon

Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence

Add code
Bookmark button
Alert button
Feb 02, 2024
Yichuan Deng, Zhao Song, Chiwun Yang

Viaarxiv icon

Local Convergence of Approximate Newton Method for Two Layer Nonlinear Regression

Add code
Bookmark button
Alert button
Nov 26, 2023
Zhihang Li, Zhao Song, Zifan Wang, Junze Yin

Viaarxiv icon

Revisiting Quantum Algorithms for Linear Regressions: Quadratic Speedups without Data-Dependent Parameters

Add code
Bookmark button
Alert button
Nov 24, 2023
Zhao Song, Junze Yin, Ruizhe Zhang

Viaarxiv icon

One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space

Add code
Bookmark button
Alert button
Nov 24, 2023
Raghav Addanki, Chenyang Li, Zhao Song, Chiwun Yang

Viaarxiv icon