Picture for Anna Rumshisky

Anna Rumshisky

An Efficient DP-SGD Mechanism for Large Scale NLP Models

Add code
Jul 14, 2021
Figure 1 for An Efficient DP-SGD Mechanism for Large Scale NLP Models
Figure 2 for An Efficient DP-SGD Mechanism for Large Scale NLP Models
Viaarxiv icon

BERT Busters: Outlier Dimensions that Disrupt Transformers

Add code
Jun 02, 2021
Figure 1 for BERT Busters: Outlier Dimensions that Disrupt Transformers
Figure 2 for BERT Busters: Outlier Dimensions that Disrupt Transformers
Figure 3 for BERT Busters: Outlier Dimensions that Disrupt Transformers
Figure 4 for BERT Busters: Outlier Dimensions that Disrupt Transformers
Viaarxiv icon

Continual Learning for Neural Semantic Parsing

Add code
Oct 15, 2020
Figure 1 for Continual Learning for Neural Semantic Parsing
Figure 2 for Continual Learning for Neural Semantic Parsing
Figure 3 for Continual Learning for Neural Semantic Parsing
Figure 4 for Continual Learning for Neural Semantic Parsing
Viaarxiv icon

When BERT Plays the Lottery, All Tickets Are Winning

Add code
May 01, 2020
Figure 1 for When BERT Plays the Lottery, All Tickets Are Winning
Figure 2 for When BERT Plays the Lottery, All Tickets Are Winning
Figure 3 for When BERT Plays the Lottery, All Tickets Are Winning
Figure 4 for When BERT Plays the Lottery, All Tickets Are Winning
Viaarxiv icon

A Primer in BERTology: What we know about how BERT works

Add code
Feb 27, 2020
Figure 1 for A Primer in BERTology: What we know about how BERT works
Figure 2 for A Primer in BERTology: What we know about how BERT works
Figure 3 for A Primer in BERTology: What we know about how BERT works
Figure 4 for A Primer in BERTology: What we know about how BERT works
Viaarxiv icon

Memory-Augmented Recurrent Networks for Dialogue Coherence

Add code
Oct 16, 2019
Figure 1 for Memory-Augmented Recurrent Networks for Dialogue Coherence
Figure 2 for Memory-Augmented Recurrent Networks for Dialogue Coherence
Figure 3 for Memory-Augmented Recurrent Networks for Dialogue Coherence
Figure 4 for Memory-Augmented Recurrent Networks for Dialogue Coherence
Viaarxiv icon

Injecting Hierarchy with U-Net Transformers

Add code
Oct 16, 2019
Figure 1 for Injecting Hierarchy with U-Net Transformers
Figure 2 for Injecting Hierarchy with U-Net Transformers
Figure 3 for Injecting Hierarchy with U-Net Transformers
Figure 4 for Injecting Hierarchy with U-Net Transformers
Viaarxiv icon

Revealing the Dark Secrets of BERT

Add code
Sep 11, 2019
Figure 1 for Revealing the Dark Secrets of BERT
Figure 2 for Revealing the Dark Secrets of BERT
Figure 3 for Revealing the Dark Secrets of BERT
Figure 4 for Revealing the Dark Secrets of BERT
Viaarxiv icon

NarrativeTime: Dense High-Speed Temporal Annotation on a Timeline

Add code
Aug 29, 2019
Figure 1 for NarrativeTime: Dense High-Speed Temporal Annotation on a Timeline
Figure 2 for NarrativeTime: Dense High-Speed Temporal Annotation on a Timeline
Figure 3 for NarrativeTime: Dense High-Speed Temporal Annotation on a Timeline
Figure 4 for NarrativeTime: Dense High-Speed Temporal Annotation on a Timeline
Viaarxiv icon

Solving Math Word Problems with Double-Decoder Transformer

Add code
Aug 28, 2019
Figure 1 for Solving Math Word Problems with Double-Decoder Transformer
Figure 2 for Solving Math Word Problems with Double-Decoder Transformer
Viaarxiv icon