Picture for Frederick Liu

Frederick Liu

Dima

FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features

Add code
Feb 01, 2023
Figure 1 for FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features
Figure 2 for FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features
Figure 3 for FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features
Figure 4 for FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features
Viaarxiv icon

DETR++: Taming Your Multi-Scale Detection Transformer

Add code
Jun 07, 2022
Figure 1 for DETR++: Taming Your Multi-Scale Detection Transformer
Figure 2 for DETR++: Taming Your Multi-Scale Detection Transformer
Figure 3 for DETR++: Taming Your Multi-Scale Detection Transformer
Viaarxiv icon

Chefs' Random Tables: Non-Trigonometric Random Features

Add code
May 30, 2022
Figure 1 for Chefs' Random Tables: Non-Trigonometric Random Features
Figure 2 for Chefs' Random Tables: Non-Trigonometric Random Features
Figure 3 for Chefs' Random Tables: Non-Trigonometric Random Features
Figure 4 for Chefs' Random Tables: Non-Trigonometric Random Features
Viaarxiv icon

Tracing Knowledge in Language Models Back to the Training Data

Add code
May 24, 2022
Figure 1 for Tracing Knowledge in Language Models Back to the Training Data
Figure 2 for Tracing Knowledge in Language Models Back to the Training Data
Figure 3 for Tracing Knowledge in Language Models Back to the Training Data
Figure 4 for Tracing Knowledge in Language Models Back to the Training Data
Viaarxiv icon

Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations

Add code
Feb 24, 2022
Figure 1 for Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations
Figure 2 for Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations
Figure 3 for Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations
Figure 4 for Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations
Viaarxiv icon

First is Better Than Last for Training Data Influence

Add code
Feb 24, 2022
Figure 1 for First is Better Than Last for Training Data Influence
Figure 2 for First is Better Than Last for Training Data Influence
Figure 3 for First is Better Than Last for Training Data Influence
Figure 4 for First is Better Than Last for Training Data Influence
Viaarxiv icon

EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks

Add code
Oct 16, 2021
Figure 1 for EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
Figure 2 for EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
Figure 3 for EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
Viaarxiv icon

Leveraging redundancy in attention with Reuse Transformers

Add code
Oct 13, 2021
Figure 1 for Leveraging redundancy in attention with Reuse Transformers
Figure 2 for Leveraging redundancy in attention with Reuse Transformers
Figure 3 for Leveraging redundancy in attention with Reuse Transformers
Figure 4 for Leveraging redundancy in attention with Reuse Transformers
Viaarxiv icon

Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles

Add code
Jun 29, 2021
Figure 1 for Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
Figure 2 for Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
Figure 3 for Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
Figure 4 for Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
Viaarxiv icon

The Penalty Imposed by Ablated Data Augmentation

Add code
Jun 08, 2020
Figure 1 for The Penalty Imposed by Ablated Data Augmentation
Figure 2 for The Penalty Imposed by Ablated Data Augmentation
Figure 3 for The Penalty Imposed by Ablated Data Augmentation
Figure 4 for The Penalty Imposed by Ablated Data Augmentation
Viaarxiv icon