Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Benjamin Hoover

LMdiff: A Visual Diff Tool to Compare Language Models

Nov 02, 2021

Hendrik Strobelt, Benjamin Hoover, Arvind Satyanarayan, Sebastian Gehrmann

Figure 1 for LMdiff: A Visual Diff Tool to Compare Language Models

Figure 2 for LMdiff: A Visual Diff Tool to Compare Language Models

Figure 3 for LMdiff: A Visual Diff Tool to Compare Language Models

Figure 4 for LMdiff: A Visual Diff Tool to Compare Language Models

Abstract:While different language models are ubiquitous in NLP, it is hard to contrast their outputs and identify which contexts one can handle better than the other. To address this question, we introduce LMdiff, a tool that visually compares probability distributions of two models that differ, e.g., through finetuning, distillation, or simply training with different parameter sizes. LMdiff allows the generation of hypotheses about model behavior by investigating text instances token by token and further assists in choosing these interesting text instances by identifying the most interesting phrases from large corpora. We showcase the applicability of LMdiff for hypothesis generation across multiple case studies. A demo is available at http://lmdiff.net .

* EMNLP 2021 Demo Paper

Via

Access Paper or Ask Questions

Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment

Jul 20, 2021

Angie Boggust, Benjamin Hoover, Arvind Satyanarayan, Hendrik Strobelt

Figure 1 for Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment

Figure 2 for Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment

Figure 3 for Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment

Abstract:Saliency methods -- techniques to identify the importance of input features on a model's output -- are a common first step in understanding neural network behavior. However, interpreting saliency requires tedious manual inspection to identify and aggregate patterns in model behavior, resulting in ad hoc or cherry-picked analysis. To address these concerns, we present Shared Interest: a set of metrics for comparing saliency with human annotated ground truths. By providing quantitative descriptors, Shared Interest allows ranking, sorting, and aggregation of inputs thereby facilitating large-scale systematic analysis of model behavior. We use Shared Interest to identify eight recurring patterns in model behavior including focusing on a sufficient subset of ground truth features or being distracted by contextual features. Working with representative real-world users, we show how Shared Interest can be used to rapidly develop or lose trust in a model's reliability, uncover issues that are missed in manual analyses, and enable interactive probing of model behavior.

* 14 pages, 8 figures. For more details, see http://shared-interest.csail.mit.edu

Via

Access Paper or Ask Questions

FairyTailor: A Multimodal Generative Framework for Storytelling

Jul 13, 2021

Eden Bensaid, Mauro Martino, Benjamin Hoover, Jacob Andreas, Hendrik Strobelt

Figure 1 for FairyTailor: A Multimodal Generative Framework for Storytelling

Figure 2 for FairyTailor: A Multimodal Generative Framework for Storytelling

Figure 3 for FairyTailor: A Multimodal Generative Framework for Storytelling

Figure 4 for FairyTailor: A Multimodal Generative Framework for Storytelling

Abstract:Storytelling is an open-ended task that entails creative thinking and requires a constant flow of ideas. Natural language generation (NLG) for storytelling is especially challenging because it requires the generated text to follow an overall theme while remaining creative and diverse to engage the reader. In this work, we introduce a system and a web-based demo, FairyTailor, for human-in-the-loop visual story co-creation. Users can create a cohesive children's fairytale by weaving generated texts and retrieved images with their input. FairyTailor adds another modality and modifies the text generation process to produce a coherent and creative sequence of text and images. To our knowledge, this is the first dynamic tool for multimodal story generation that allows interactive co-formation of both texts and images. It allows users to give feedback on co-created stories and share their results.

* visit https://fairytailor.org/ and https://github.com/EdenBD/MultiModalStory-demo for web demo and source code

Via

Access Paper or Ask Questions

Can a Fruit Fly Learn Word Embeddings?

Jan 18, 2021

Yuchen Liang, Chaitanya K. Ryali, Benjamin Hoover, Leopold Grinberg, Saket Navlakha, Mohammed J. Zaki, Dmitry Krotov

Figure 1 for Can a Fruit Fly Learn Word Embeddings?

Figure 2 for Can a Fruit Fly Learn Word Embeddings?

Figure 3 for Can a Fruit Fly Learn Word Embeddings?

Figure 4 for Can a Fruit Fly Learn Word Embeddings?

Abstract:The mushroom body of the fruit fly brain is one of the best studied systems in neuroscience. At its core it consists of a population of Kenyon cells, which receive inputs from multiple sensory modalities. These cells are inhibited by the anterior paired lateral neuron, thus creating a sparse high dimensional representation of the inputs. In this work we study a mathematical formalization of this network motif and apply it to learning the correlational structure between words and their context in a corpus of unstructured text, a common natural language processing (NLP) task. We show that this network can learn semantic representations of words and can generate both static and context-dependent word embeddings. Unlike conventional methods (e.g., BERT, GloVe) that use dense representations for word embedding, our algorithm encodes semantic meaning of words and their context in the form of sparse binary hash codes. The quality of the learned representations is evaluated on word similarity analysis, word-sense disambiguation, and document classification. It is shown that not only can the fruit fly network motif achieve performance comparable to existing methods in NLP, but, additionally, it uses only a fraction of the computational resources (shorter training time and smaller memory footprint).

* Accepted for publication at ICLR 2021

Via

Access Paper or Ask Questions

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Oct 11, 2019

Benjamin Hoover, Hendrik Strobelt, Sebastian Gehrmann

Figure 1 for exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Figure 2 for exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Figure 3 for exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Figure 4 for exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Abstract:Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Since these models are typically guided by a sequence of learned self attention mechanisms and may comprise undesired inductive biases, it is paramount to be able to explore what the attention has learned. While static analyses of these models lead to targeted insights, interactive tools are more dynamic and can help humans better gain an intuition for the model-internal reasoning process. We present exBERT, an interactive tool named after the popular BERT language model, that provides insights into the meaning of the contextual representations by matching a human-specified input to similar contexts in a large annotated dataset. By aggregating the annotations of the matching similar contexts, exBERT helps intuitively explain what each attention-head has learned.

Via

Access Paper or Ask Questions