Picture for Niru Maheswaranathan

Niru Maheswaranathan

Practical tradeoffs between memory, compute, and performance in learned optimizers

Add code
Apr 01, 2022
Figure 1 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 2 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 3 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Figure 4 for Practical tradeoffs between memory, compute, and performance in learned optimizers
Viaarxiv icon

Understanding How Encoder-Decoder Architectures Attend

Add code
Oct 28, 2021
Figure 1 for Understanding How Encoder-Decoder Architectures Attend
Figure 2 for Understanding How Encoder-Decoder Architectures Attend
Figure 3 for Understanding How Encoder-Decoder Architectures Attend
Figure 4 for Understanding How Encoder-Decoder Architectures Attend
Viaarxiv icon

Training Learned Optimizers with Randomly Initialized Learned Optimizers

Add code
Jan 14, 2021
Figure 1 for Training Learned Optimizers with Randomly Initialized Learned Optimizers
Figure 2 for Training Learned Optimizers with Randomly Initialized Learned Optimizers
Viaarxiv icon

Reverse engineering learned optimizers reveals known and novel mechanisms

Add code
Nov 04, 2020
Figure 1 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 2 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 3 for Reverse engineering learned optimizers reveals known and novel mechanisms
Figure 4 for Reverse engineering learned optimizers reveals known and novel mechanisms
Viaarxiv icon

The geometry of integration in text classification RNNs

Add code
Oct 28, 2020
Figure 1 for The geometry of integration in text classification RNNs
Figure 2 for The geometry of integration in text classification RNNs
Figure 3 for The geometry of integration in text classification RNNs
Figure 4 for The geometry of integration in text classification RNNs
Viaarxiv icon

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Add code
Sep 23, 2020
Figure 1 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 2 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 3 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Figure 4 for Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves
Viaarxiv icon

How recurrent networks implement contextual processing in sentiment analysis

Add code
Apr 17, 2020
Figure 1 for How recurrent networks implement contextual processing in sentiment analysis
Figure 2 for How recurrent networks implement contextual processing in sentiment analysis
Figure 3 for How recurrent networks implement contextual processing in sentiment analysis
Figure 4 for How recurrent networks implement contextual processing in sentiment analysis
Viaarxiv icon

Using a thousand optimization tasks to learn hyperparameter search strategies

Add code
Mar 11, 2020
Figure 1 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 2 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 3 for Using a thousand optimization tasks to learn hyperparameter search strategies
Figure 4 for Using a thousand optimization tasks to learn hyperparameter search strategies
Viaarxiv icon

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Add code
Dec 12, 2019
Figure 1 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 2 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 3 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Figure 4 for From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
Viaarxiv icon

Universality and individuality in neural dynamics across large populations of recurrent networks

Add code
Jul 19, 2019
Figure 1 for Universality and individuality in neural dynamics across large populations of recurrent networks
Figure 2 for Universality and individuality in neural dynamics across large populations of recurrent networks
Figure 3 for Universality and individuality in neural dynamics across large populations of recurrent networks
Figure 4 for Universality and individuality in neural dynamics across large populations of recurrent networks
Viaarxiv icon