Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adam Poliak

What do you learn from context? Probing for sentence structure in contextualized word representations

May 15, 2019

Ian Tenney, Patrick Xia, Berlin Chen, Alex Wang, Adam Poliak, R Thomas McCoy, Najoung Kim, Benjamin Van Durme, Samuel R. Bowman, Dipanjan Das(+1 more)

Figure 1 for What do you learn from context? Probing for sentence structure in contextualized word representations

Figure 2 for What do you learn from context? Probing for sentence structure in contextualized word representations

Figure 3 for What do you learn from context? Probing for sentence structure in contextualized word representations

Figure 4 for What do you learn from context? Probing for sentence structure in contextualized word representations

Abstract:Contextualized representation models such as ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2018) have recently achieved state-of-the-art results on a diverse array of downstream NLP tasks. Building on recent token-level probing work, we introduce a novel edge probing task design and construct a broad suite of sub-sentence tasks derived from the traditional structured NLP pipeline. We probe word-level contextual representations from four recent models and investigate how they encode sentence structure across a range of syntactic, semantic, local, and long-range phenomena. We find that existing models trained on language modeling and translation produce strong representations for syntactic phenomena, but only offer comparably small improvements on semantic tasks over a non-contextual baseline.

* ICLR 2019 camera-ready version, 17 pages including appendices

Via

Access Paper or Ask Questions

Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Apr 25, 2019

Najoung Kim, Roma Patel, Adam Poliak, Alex Wang, Patrick Xia, R. Thomas McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme(+2 more)

Figure 1 for Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Figure 2 for Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Figure 3 for Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Figure 4 for Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Abstract:We introduce a set of nine challenge tasks that test for the understanding of function words. These tasks are created by structurally mutating sentences from existing datasets to target the comprehension of specific types of function words (e.g., prepositions, wh-words). Using these probing tasks, we explore the effects of various pretraining objectives for sentence encoders (e.g., language modeling, CCG supertagging and natural language inference (NLI)) on the learned representations. Our results show that pretraining on CCG---our most syntactic objective---performs the best on average across our probing tasks, suggesting that syntactic knowledge helps function word comprehension. Language modeling also shows strong performance, supporting its widespread use for pretraining state-of-the-art NLP models. Overall, no pretraining objective dominates across the board, and our function word probing tasks highlight several intuitive differences between pretraining objectives, e.g., that NLI helps the comprehension of negation.

* Accepted to *SEM 2019

Via

Access Paper or Ask Questions

Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Aug 29, 2018

Adam Poliak, Aparajita Haldar, Rachel Rudinger, J. Edward Hu, Ellie Pavlick, Aaron Steven White, Benjamin Van Durme

Figure 1 for Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Figure 2 for Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Figure 3 for Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Figure 4 for Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Abstract:We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our collection as the DNC: Diverse Natural Language Inference Collection. The DNC is available online at https://www.decomp.net, and will grow over time as additional resources are recast and added from novel sources.

* To be presented at EMNLP 2018. 15 pages

Via

Access Paper or Ask Questions

On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

May 06, 2018

Adam Poliak, Yonatan Belinkov, James Glass, Benjamin Van Durme

Figure 1 for On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

Figure 2 for On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

Figure 3 for On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

Figure 4 for On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

Abstract:We propose a process for investigating the extent to which sentence representations arising from neural machine translation (NMT) systems encode distinct semantic phenomena. We use these representations as features to train a natural language inference (NLI) classifier based on datasets recast from existing semantic annotations. In applying this process to a representative NMT system, we find its encoder appears most suited to supporting inferences at the syntax-semantics interface, as compared to anaphora resolution requiring world-knowledge. We conclude with a discussion on the merits and potential deficiencies of the existing process, and how it may be improved and extended as a broader framework for evaluating semantic coverage.

* To be presented at NAACL 2018 - 11 pages

Via

Access Paper or Ask Questions

Hypothesis Only Baselines in Natural Language Inference

May 02, 2018

Adam Poliak, Jason Naradowsky, Aparajita Haldar, Rachel Rudinger, Benjamin Van Durme

Figure 1 for Hypothesis Only Baselines in Natural Language Inference

Figure 2 for Hypothesis Only Baselines in Natural Language Inference

Figure 3 for Hypothesis Only Baselines in Natural Language Inference

Figure 4 for Hypothesis Only Baselines in Natural Language Inference

Abstract:We propose a hypothesis only baseline for diagnosing Natural Language Inference (NLI). Especially when an NLI dataset assumes inference is occurring based purely on the relationship between a context and a hypothesis, it follows that assessing entailment relations while ignoring the provided context is a degenerate solution. Yet, through experiments on ten distinct NLI datasets, we find that this approach, which we refer to as a hypothesis-only model, is able to significantly outperform a majority class baseline across a number of NLI datasets. Our analysis suggests that statistical irregularities may allow a model to perform NLI in some datasets beyond what should be achievable without access to the context.

* Accepted at *SEM 2018 as long paper. 12 pages

Via

Access Paper or Ask Questions

Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

Jun 29, 2017

Francis Ferraro, Adam Poliak, Ryan Cotterell, Benjamin Van Durme

Figure 1 for Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

Figure 2 for Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

Figure 3 for Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

Abstract:We study how different frame annotations complement one another when learning continuous lexical semantics. We learn the representations from a tensorized skip-gram model that consistently encodes syntactic-semantic content better, with multiple 10% gains over baselines.

* Accepted at the Sixth Joint Conference on Lexical and Computational Semantics (*SEM). Association for Computational Linguistics, Vancouver, Canada. 2017

Via

Access Paper or Ask Questions