Picture for Ankit Gupta

Ankit Gupta

Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks

Add code
Oct 20, 2022
Figure 1 for Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks
Figure 2 for Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks
Figure 3 for Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks
Figure 4 for Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks
Viaarxiv icon

Analyzing Transformers in Embedding Space

Add code
Sep 06, 2022
Figure 1 for Analyzing Transformers in Embedding Space
Figure 2 for Analyzing Transformers in Embedding Space
Figure 3 for Analyzing Transformers in Embedding Space
Figure 4 for Analyzing Transformers in Embedding Space
Viaarxiv icon

Long Range Language Modeling via Gated State Spaces

Add code
Jul 02, 2022
Figure 1 for Long Range Language Modeling via Gated State Spaces
Figure 2 for Long Range Language Modeling via Gated State Spaces
Figure 3 for Long Range Language Modeling via Gated State Spaces
Viaarxiv icon

On the Parameterization and Initialization of Diagonal State Space Models

Add code
Jun 23, 2022
Figure 1 for On the Parameterization and Initialization of Diagonal State Space Models
Figure 2 for On the Parameterization and Initialization of Diagonal State Space Models
Figure 3 for On the Parameterization and Initialization of Diagonal State Space Models
Figure 4 for On the Parameterization and Initialization of Diagonal State Space Models
Viaarxiv icon

Diagonal State Spaces are as Effective as Structured State Spaces

Add code
Mar 27, 2022
Figure 1 for Diagonal State Spaces are as Effective as Structured State Spaces
Figure 2 for Diagonal State Spaces are as Effective as Structured State Spaces
Figure 3 for Diagonal State Spaces are as Effective as Structured State Spaces
Viaarxiv icon

Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements

Add code
Feb 10, 2022
Figure 1 for Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements
Figure 2 for Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements
Figure 3 for Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements
Figure 4 for Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements
Viaarxiv icon

SCROLLS: Standardized CompaRison Over Long Language Sequences

Add code
Jan 10, 2022
Figure 1 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 2 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 3 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 4 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Viaarxiv icon

Memory-efficient Transformers via Top-$k$ Attention

Add code
Jun 13, 2021
Figure 1 for Memory-efficient Transformers via Top-$k$ Attention
Figure 2 for Memory-efficient Transformers via Top-$k$ Attention
Figure 3 for Memory-efficient Transformers via Top-$k$ Attention
Figure 4 for Memory-efficient Transformers via Top-$k$ Attention
Viaarxiv icon

Value-aware Approximate Attention

Add code
Mar 17, 2021
Figure 1 for Value-aware Approximate Attention
Figure 2 for Value-aware Approximate Attention
Figure 3 for Value-aware Approximate Attention
Figure 4 for Value-aware Approximate Attention
Viaarxiv icon

DART: Open-Domain Structured Data Record to Text Generation

Add code
Jul 06, 2020
Figure 1 for DART: Open-Domain Structured Data Record to Text Generation
Figure 2 for DART: Open-Domain Structured Data Record to Text Generation
Figure 3 for DART: Open-Domain Structured Data Record to Text Generation
Figure 4 for DART: Open-Domain Structured Data Record to Text Generation
Viaarxiv icon