Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:How Context Affects Language Models' Factual Predictions

May 10, 2020

Fabio Petroni, Patrick Lewis, Aleksandra Piktus, Tim Rocktäschel, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel

Figure 1 for How Context Affects Language Models' Factual Predictions

Figure 2 for How Context Affects Language Models' Factual Predictions

Figure 3 for How Context Affects Language Models' Factual Predictions

Figure 4 for How Context Affects Language Models' Factual Predictions

Share this with someone who'll enjoy it:

Abstract:When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing factual knowledge in a fixed number of weights of a language model clearly has limitations. Previous approaches have successfully provided access to information outside the model weights using supervised architectures that combine an information retrieval system with a machine reading component. In this paper, we go a step further and integrate information from a retrieval system with a pre-trained language model in a purely unsupervised way. We report that augmenting pre-trained language models in this way dramatically improves performance and that the resulting system, despite being unsupervised, is competitive with a supervised machine reading baseline. Furthermore, processing query and context with different segment tokens allows BERT to utilize its Next Sentence Prediction pre-trained classifier to determine whether the context is relevant or not, substantially improving BERT's zero-shot cloze-style question-answering performance and making its predictions robust to noisy contexts.

* accepted at AKBC 2020

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:How Context Affects Language Models' Factual Predictions

Paper and Code