Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Sep 17, 2020

Shayne Longpre, Yi Lu, Christopher DuBois

Figure 1 for On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Figure 2 for On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Figure 3 for On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Figure 4 for On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Share this with someone who'll enjoy it:

Abstract:Recent work (Feng et al., 2018) establishes the presence of short, uninterpretable input fragments that yield high confidence and accuracy in neural models. We refer to these as Minimal Prediction Preserving Inputs (MPPIs). In the context of question answering, we investigate competing hypotheses for the existence of MPPIs, including poor posterior calibration of neural models, lack of pretraining, and "dataset bias" (where a model learns to attend to spurious, non-generalizable cues in the training data). We discover a perplexing invariance of MPPIs to random training seed, model architecture, pretraining, and training domain. MPPIs demonstrate remarkable transferability across domains - closing half the gap between models' performance on comparably short queries and original queries. Additionally, penalizing over-confidence on MPPIs fails to improve either generalization or adversarial robustness. These results suggest the interpretability of MPPIs is insufficient to characterize generalization capacity of these models. We hope this focused investigation encourages a more systematic analysis of model behavior outside of the human interpretable distribution of examples.

View paper on

Share this with someone who'll enjoy it:

Title:On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Paper and Code