Alert button
Picture for Niru Maheswaranathan

Niru Maheswaranathan

Alert button

Practical tradeoffs between memory, compute, and performance in learned optimizers

Add code
Bookmark button
Alert button
Apr 01, 2022
Luke Metz, C. Daniel Freeman, James Harrison, Niru Maheswaranathan, Jascha Sohl-Dickstein

Figure 1 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 2 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 3 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 4 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Viaarxiv icon

Understanding How Encoder-Decoder Architectures Attend

Add code
Bookmark button
Alert button
Oct 28, 2021
Kyle Aitken, Vinay V Ramasesh, Yuan Cao, Niru Maheswaranathan

Figure 1 for Understanding How Encoder-Decoder Architectures Attend
Figure 2 for Understanding How Encoder-Decoder Architectures Attend
Figure 3 for Understanding How Encoder-Decoder Architectures Attend
Figure 4 for Understanding How Encoder-Decoder Architectures Attend
Viaarxiv icon

Training Learned Optimizers with Randomly Initialized Learned Optimizers

Add code
Bookmark button
Alert button
Jan 14, 2021
Luke Metz, C. Daniel Freeman, Niru Maheswaranathan, Jascha Sohl-Dickstein

Figure 1 for Training Learned Optimizers with Randomly Initialized Learned Optimizers
Figure 2 for Training Learned Optimizers with Randomly Initialized Learned Optimizers
Viaarxiv icon

Reverse engineering learned optimizers reveals known and novel mechanisms

Add code
Bookmark button
Alert button
Nov 04, 2020
Niru Maheswaranathan, David Sussillo, Luke Metz, Ruoxi Sun, Jascha Sohl-Dickstein

Figure 1 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 2 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 3 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 4 for Reverse engineering learned optimizers reveals known and novel mechanisms
Viaarxiv icon

The geometry of integration in text classification RNNs

Add code
Bookmark button
Alert button
Oct 28, 2020
Kyle Aitken, Vinay V. Ramasesh, Ankush Garg, Yuan Cao, David Sussillo, Niru Maheswaranathan

Figure 1 for The geometry of integration in text classification RNNs
Figure 2 for The geometry of integration in text classification RNNs
Figure 3 for The geometry of integration in text classification RNNs
Figure 4 for The geometry of integration in text classification RNNs
Viaarxiv icon

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Add code
Bookmark button
Alert button
Sep 23, 2020
Luke Metz, Niru Maheswaranathan, C. Daniel Freeman, Ben Poole, Jascha Sohl-Dickstein

Figure 1 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 2 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 3 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 4 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Viaarxiv icon

How recurrent networks implement contextual processing in sentiment analysis

Add code
Bookmark button
Alert button
Apr 17, 2020
Niru Maheswaranathan, David Sussillo

Figure 1 for How recurrent networks implement contextual processing in sentiment analysis
Figure 2 for How recurrent networks implement contextual processing in sentiment analysis
Figure 3 for How recurrent networks implement contextual processing in sentiment analysis
Figure 4 for How recurrent networks implement contextual processing in sentiment analysis
Viaarxiv icon

Using a thousand optimization tasks to learn hyperparameter search strategies

Add code
Bookmark button
Alert button
Mar 11, 2020
Luke Metz, Niru Maheswaranathan, Ruoxi Sun, C. Daniel Freeman, Ben Poole, Jascha Sohl-Dickstein

Figure 1 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 2 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 3 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 4 for Using a thousand optimization tasks to learn hyperparameter search strategies
Viaarxiv icon

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Add code
Bookmark button
Alert button
Dec 12, 2019
Hidenori Tanaka, Aran Nayebi, Niru Maheswaranathan, Lane McIntosh, Stephen A. Baccus, Surya Ganguli

Figure 1 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 2 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 3 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 4 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Viaarxiv icon