Alert button
Picture for Cyril Zhang

Cyril Zhang

Alert button

Disentangling Adaptive Gradient Methods from Learning Rates

Feb 26, 2020
Naman Agarwal, Rohan Anil, Elad Hazan, Tomer Koren, Cyril Zhang

Figure 1 for Disentangling Adaptive Gradient Methods from Learning Rates
Figure 2 for Disentangling Adaptive Gradient Methods from Learning Rates
Figure 3 for Disentangling Adaptive Gradient Methods from Learning Rates
Figure 4 for Disentangling Adaptive Gradient Methods from Learning Rates
Viaarxiv icon

No-Regret Prediction in Marginally Stable Systems

Feb 20, 2020
Udaya Ghai, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

Viaarxiv icon

Calibration, Entropy Rates, and Memory in Language Models

Jun 11, 2019
Mark Braverman, Xinyi Chen, Sham M. Kakade, Karthik Narasimhan, Cyril Zhang, Yi Zhang

Figure 1 for Calibration, Entropy Rates, and Memory in Language Models
Figure 2 for Calibration, Entropy Rates, and Memory in Language Models
Figure 3 for Calibration, Entropy Rates, and Memory in Language Models
Figure 4 for Calibration, Entropy Rates, and Memory in Language Models
Viaarxiv icon

Robust guarantees for learning an autoregressive filter

May 23, 2019
Holden Lee, Cyril Zhang

Viaarxiv icon

Extreme Tensoring for Low-Memory Preconditioning

Feb 12, 2019
Xinyi Chen, Naman Agarwal, Elad Hazan, Cyril Zhang, Yi Zhang

Figure 1 for Extreme Tensoring for Low-Memory Preconditioning
Figure 2 for Extreme Tensoring for Low-Memory Preconditioning
Figure 3 for Extreme Tensoring for Low-Memory Preconditioning
Figure 4 for Extreme Tensoring for Low-Memory Preconditioning
Viaarxiv icon

The Case for Full-Matrix Adaptive Regularization

Jun 08, 2018
Naman Agarwal, Brian Bullins, Xinyi Chen, Elad Hazan, Karan Singh, Cyril Zhang, Yi Zhang

Figure 1 for The Case for Full-Matrix Adaptive Regularization
Figure 2 for The Case for Full-Matrix Adaptive Regularization
Figure 3 for The Case for Full-Matrix Adaptive Regularization
Figure 4 for The Case for Full-Matrix Adaptive Regularization
Viaarxiv icon

Not-So-Random Features

Feb 27, 2018
Brian Bullins, Cyril Zhang, Yi Zhang

Figure 1 for Not-So-Random Features
Figure 2 for Not-So-Random Features
Viaarxiv icon

Spectral Filtering for General Linear Dynamical Systems

Feb 12, 2018
Elad Hazan, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

Figure 1 for Spectral Filtering for General Linear Dynamical Systems
Viaarxiv icon

Learning Linear Dynamical Systems via Spectral Filtering

Nov 02, 2017
Elad Hazan, Karan Singh, Cyril Zhang

Figure 1 for Learning Linear Dynamical Systems via Spectral Filtering
Figure 2 for Learning Linear Dynamical Systems via Spectral Filtering
Viaarxiv icon