Alert button
Picture for Denny Wu

Denny Wu

Alert button

Nonlinear spiked covariance matrices and signal propagation in deep neural networks

Add code
Bookmark button
Alert button
Feb 15, 2024
Zhichao Wang, Denny Wu, Zhou Fan

Viaarxiv icon

Gradient-Based Feature Learning under Structured Data

Add code
Bookmark button
Alert button
Sep 07, 2023
Alireza Mousavi-Hosseini, Denny Wu, Taiji Suzuki, Murat A. Erdogdu

Figure 1 for Gradient-Based Feature Learning under Structured Data
Viaarxiv icon

Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction

Add code
Bookmark button
Alert button
Jun 12, 2023
Taiji Suzuki, Denny Wu, Atsushi Nitanda

Figure 1 for Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction
Viaarxiv icon

Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems

Add code
Bookmark button
Alert button
Mar 06, 2023
Atsushi Nitanda, Kazusato Oko, Denny Wu, Nobuhito Takenouchi, Taiji Suzuki

Figure 1 for Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems
Figure 2 for Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems
Figure 3 for Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems
Figure 4 for Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems
Viaarxiv icon

High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation

Add code
Bookmark button
Alert button
May 03, 2022
Jimmy Ba, Murat A. Erdogdu, Taiji Suzuki, Zhichao Wang, Denny Wu, Greg Yang

Figure 1 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 2 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 3 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 4 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Viaarxiv icon

Convex Analysis of the Mean Field Langevin Dynamics

Add code
Bookmark button
Alert button
Jan 25, 2022
Atsushi Nitanda, Denny Wu, Taiji Suzuki

Figure 1 for Convex Analysis of the Mean Field Langevin Dynamics
Figure 2 for Convex Analysis of the Mean Field Langevin Dynamics
Viaarxiv icon

Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis

Add code
Bookmark button
Alert button
Dec 31, 2020
Atsushi Nitanda, Denny Wu, Taiji Suzuki

Figure 1 for Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis
Figure 2 for Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis
Figure 3 for Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis
Figure 4 for Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis
Viaarxiv icon

When Does Preconditioning Help or Hurt Generalization?

Add code
Bookmark button
Alert button
Jul 02, 2020
Shun-ichi Amari, Jimmy Ba, Roger Grosse, Xuechen Li, Atsushi Nitanda, Taiji Suzuki, Denny Wu, Ji Xu

Figure 1 for When Does Preconditioning Help or Hurt Generalization?
Figure 2 for When Does Preconditioning Help or Hurt Generalization?
Figure 3 for When Does Preconditioning Help or Hurt Generalization?
Figure 4 for When Does Preconditioning Help or Hurt Generalization?
Viaarxiv icon

On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression

Add code
Bookmark button
Alert button
Jun 25, 2020
Denny Wu, Ji Xu

Figure 1 for On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression
Figure 2 for On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression
Figure 3 for On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression
Figure 4 for On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression
Viaarxiv icon