Alert button
Picture for Molei Tao

Molei Tao

Alert button

Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport

Add code
Bookmark button
Alert button
May 27, 2022
Lingkai Kong, Yuqing Wang, Molei Tao

Figure 1 for Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport
Figure 2 for Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport
Figure 3 for Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport
Figure 4 for Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport
Viaarxiv icon

The Mirror Langevin Algorithm Converges with Vanishing Bias

Add code
Bookmark button
Alert button
Oct 11, 2021
Ruilin Li, Molei Tao, Santosh S. Vempala, Andre Wibisono

Viaarxiv icon

Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect

Add code
Bookmark button
Alert button
Oct 07, 2021
Yuqing Wang, Minshuo Chen, Tuo Zhao, Molei Tao

Figure 1 for Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect
Figure 2 for Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect
Figure 3 for Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect
Figure 4 for Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect
Viaarxiv icon

Sqrt(d) Dimension Dependence of Langevin Monte Carlo

Add code
Bookmark button
Alert button
Sep 23, 2021
Ruilin Li, Hongyuan Zha, Molei Tao

Figure 1 for Sqrt(d) Dimension Dependence of Langevin Monte Carlo
Figure 2 for Sqrt(d) Dimension Dependence of Langevin Monte Carlo
Viaarxiv icon

Mean-Square Analysis with An Application to Optimal Dimension Dependence of Langevin Monte Carlo

Add code
Bookmark button
Alert button
Sep 08, 2021
Ruilin Li, Hongyuan Zha, Molei Tao

Figure 1 for Mean-Square Analysis with An Application to Optimal Dimension Dependence of Langevin Monte Carlo
Figure 2 for Mean-Square Analysis with An Application to Optimal Dimension Dependence of Langevin Monte Carlo
Viaarxiv icon

Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps

Add code
Bookmark button
Alert button
Mar 09, 2021
Renyi Chen, Molei Tao

Figure 1 for Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps
Figure 2 for Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps
Figure 3 for Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps
Figure 4 for Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps
Viaarxiv icon

Hessian-Free High-Resolution Nesterov Acceleration for Sampling

Add code
Bookmark button
Alert button
Jun 22, 2020
Ruilin Li, Hongyuan Zha, Molei Tao

Figure 1 for Hessian-Free High-Resolution Nesterov Acceleration for Sampling
Figure 2 for Hessian-Free High-Resolution Nesterov Acceleration for Sampling
Figure 3 for Hessian-Free High-Resolution Nesterov Acceleration for Sampling
Figure 4 for Hessian-Free High-Resolution Nesterov Acceleration for Sampling
Viaarxiv icon

Hessian-Free High-Resolution Nesterov Accelerationfor Sampling

Add code
Bookmark button
Alert button
Jun 16, 2020
Ruilin Li, Hongyuan Zha, Molei Tao

Figure 1 for Hessian-Free High-Resolution Nesterov Accelerationfor Sampling
Figure 2 for Hessian-Free High-Resolution Nesterov Accelerationfor Sampling
Figure 3 for Hessian-Free High-Resolution Nesterov Accelerationfor Sampling
Figure 4 for Hessian-Free High-Resolution Nesterov Accelerationfor Sampling
Viaarxiv icon

Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients

Add code
Bookmark button
Alert button
Feb 20, 2020
Ruilin Li, Xin Wang, Hongyuan Zha, Molei Tao

Figure 1 for Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients
Figure 2 for Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients
Figure 3 for Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients
Figure 4 for Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients
Viaarxiv icon