Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan K. Kummerfeld

Exploring the Value of Personalized Word Embeddings

Nov 11, 2020
Charles Welch, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Exploring the Value of Personalized Word Embeddings

Figure 2 for Exploring the Value of Personalized Word Embeddings

Figure 3 for Exploring the Value of Personalized Word Embeddings

Figure 4 for Exploring the Value of Personalized Word Embeddings

In this paper, we introduce personalized word embeddings, and examine their value for language modeling. We compare the performance of our proposed prediction model when using personalized versus generic word representations, and study how these representations can be leveraged for improved performance. We provide insight into what types of words can be more accurately predicted when building personalized models. Our results show that a subset of words belonging to specific psycholinguistic categories tend to vary more in their representations across users and that combining generic and personalized word embeddings yields the best performance, with a 4.7% relative reduction in perplexity. Additionally, we show that a language model using personalized word embeddings can be effectively used for authorship attribution.

* COLING 2020

Via

Access Paper or Ask Questions

Compositional Demographic Word Embeddings

Oct 29, 2020
Charles Welch, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Compositional Demographic Word Embeddings

Figure 2 for Compositional Demographic Word Embeddings

Figure 3 for Compositional Demographic Word Embeddings

Figure 4 for Compositional Demographic Word Embeddings

Word embeddings are usually derived from corpora containing text from many individuals, thus leading to general purpose representations rather than individually personalized representations. While personalized embeddings can be useful to improve language model performance and other language processing tasks, they can only be computed for people with a large amount of longitudinal data, which is not the case for new users. We propose a new form of personalized word embeddings that use demographic-specific word representations derived compositionally from full or partial demographic information for a user (i.e., gender, age, location, religion). We show that the resulting demographic-aware word representations outperform generic word representations on two tasks for English: language modeling and word associations. We further explore the trade-off between the number of available attributes and their relative effectiveness and discuss the ethical implications of using them.

* To appear at EMNLP 2020

Via

Access Paper or Ask Questions

Improving Low Compute Language Modeling with In-Domain Embedding Initialisation

Sep 30, 2020
Charles Welch, Rada Mihalcea, Jonathan K. Kummerfeld

Figure 1 for Improving Low Compute Language Modeling with In-Domain Embedding Initialisation

Figure 2 for Improving Low Compute Language Modeling with In-Domain Embedding Initialisation

Figure 3 for Improving Low Compute Language Modeling with In-Domain Embedding Initialisation

Figure 4 for Improving Low Compute Language Modeling with In-Domain Embedding Initialisation

Many NLP applications, such as biomedical data and technical support, have 10-100 million tokens of in-domain data and limited computational resources for learning from it. How should we train a language model in this scenario? Most language modeling research considers either a small dataset with a closed vocabulary (like the standard 1 million token Penn Treebank), or the whole web with byte-pair encoding. We show that for our target setting in English, initialising and freezing input embeddings using in-domain data can improve language model performance by providing a useful representation of rare words, and this pattern holds across several different domains. In the process, we show that the standard convention of tying input and output embeddings does not improve perplexity when initializing with embeddings trained on in-domain data.

* To appear at EMNLP 2020

Via

Access Paper or Ask Questions

Analyzing the Surprising Variability in Word Embedding Stability Across Languages

Apr 30, 2020
Laura Burdick, Jonathan K. Kummerfeld, Rada Mihalcea

Figure 1 for Analyzing the Surprising Variability in Word Embedding Stability Across Languages

Figure 2 for Analyzing the Surprising Variability in Word Embedding Stability Across Languages

Figure 3 for Analyzing the Surprising Variability in Word Embedding Stability Across Languages

Figure 4 for Analyzing the Surprising Variability in Word Embedding Stability Across Languages

Word embeddings are powerful representations that form the foundation of many natural language processing architectures and tasks, both in English and in other languages. To gain further insight into word embeddings in multiple languages, we explore their stability, defined as the overlap between the nearest neighbors of a word in different embedding spaces. We discuss linguistic properties that are related to stability, drawing out insights about how morphological and other features relate to stability. This has implications for the usage of embeddings, particularly in research that uses embeddings to study language trends.

Via

Access Paper or Ask Questions

The Eighth Dialog System Technology Challenge

Nov 14, 2019
Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta

Figure 1 for The Eighth Dialog System Technology Challenge

Figure 2 for The Eighth Dialog System Technology Challenge

Figure 3 for The Eighth Dialog System Technology Challenge

Figure 4 for The Eighth Dialog System Technology Challenge

This paper introduces the Eighth Dialog System Technology Challenge. In line with recent challenges, the eighth edition focuses on applying end-to-end dialog technologies in a pragmatic way for multi-domain task-completion, noetic response selection, audio visual scene-aware dialog, and schema-guided dialog state tracking tasks. This paper describes the task definition, provided datasets, and evaluation set-up for each track. We also summarize the results of the submitted systems to highlight the overall trends of the state-of-the-art technologies for the tasks.

* Submitted to NeurIPS 2019 3rd Conversational AI Workshop

Via

Access Paper or Ask Questions

No Press Diplomacy: Modeling Multi-Agent Gameplay

Sep 04, 2019
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

Figure 1 for No Press Diplomacy: Modeling Multi-Agent Gameplay

Figure 2 for No Press Diplomacy: Modeling Multi-Agent Gameplay

Figure 3 for No Press Diplomacy: Modeling Multi-Agent Gameplay

Figure 4 for No Press Diplomacy: Modeling Multi-Agent Gameplay

Diplomacy is a seven-player non-stochastic, non-cooperative game, where agents acquire resources through a mix of teamwork and betrayal. Reliance on trust and coordination makes Diplomacy the first non-cooperative multi-agent benchmark for complex sequential social dilemmas in a rich environment. In this work, we focus on training an agent that learns to play the No Press version of Diplomacy where there is no dedicated communication channel between players. We present DipNet, a neural-network-based policy model for No Press Diplomacy. The model was trained on a new dataset of more than 150,000 human games. Our model is trained by supervised learning (SL) from expert trajectories, which is then used to initialize a reinforcement learning (RL) agent trained through self-play. Both the SL and RL agents demonstrate state-of-the-art No Press performance by beating popular rule-based bots.

* Accepted at NeurIPS 2019

Via

Access Paper or Ask Questions

An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

Sep 04, 2019
Stefan Larson, Anish Mahendran, Joseph J. Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K. Kummerfeld, Kevin Leach, Michael A. Laurenzano, Lingjia Tang, Jason Mars

Figure 1 for An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

Figure 2 for An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

Figure 3 for An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

Figure 4 for An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

Task-oriented dialog systems need to know when a query falls outside their range of supported intents, but current text classification corpora only define label sets that cover every example. We introduce a new dataset that includes queries that are out-of-scope---i.e., queries that do not fall into any of the system's supported intents. This poses a new challenge because models cannot assume that every query at inference time belongs to a system-supported intent class. Our dataset also covers 150 intent classes over 10 domains, capturing the breadth that a production task-oriented agent must handle. We evaluate a range of benchmark classifiers on our dataset along with several different out-of-scope identification schemes. We find that while the classifiers perform well on in-scope intent classification, they struggle to identify out-of-scope queries. Our dataset and evaluation fill an important gap in the field, offering a way of more rigorously and realistically benchmarking text classification in task-driven dialog systems.

* Accepted to EMNLP-IJCNLP 2019

Via

Access Paper or Ask Questions

SLATE: A Super-Lightweight Annotation Tool for Experts

Jul 18, 2019
Jonathan K. Kummerfeld

Figure 1 for SLATE: A Super-Lightweight Annotation Tool for Experts

Figure 2 for SLATE: A Super-Lightweight Annotation Tool for Experts

Figure 3 for SLATE: A Super-Lightweight Annotation Tool for Experts

Many annotation tools have been developed, covering a wide variety of tasks and providing features like user management, pre-processing, and automatic labeling. However, all of these tools use Graphical User Interfaces, and often require substantial effort to install and configure. This paper presents a new annotation tool that is designed to fill the niche of a lightweight interface for users with a terminal-based workflow. Slate supports annotation at different scales (spans of characters, tokens, and lines, or a document) and of different types (free text, labels, and links), with easily customisable keybindings, and unicode support. In a user study comparing with other tools it was consistently the easiest to install and use. Slate fills a need not met by existing systems, and has already been used to annotate two corpora, one of which involved over 250 hours of annotation effort.

* To appear at ACL as a demo

Via

Access Paper or Ask Questions

Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

Apr 25, 2019
Charles Welch, Verónica Pérez-Rosas, Jonathan K. Kummerfeld, Rada Mihalcea

Figure 1 for Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

Figure 2 for Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

Figure 3 for Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

Figure 4 for Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

We examine a large dialog corpus obtained from the conversation history of a single individual with 104 conversation partners. The corpus consists of half a million instant messages, across several messaging platforms. We focus our analyses on seven speaker attributes, each of which partitions the set of speakers, namely: gender; relative age; family member; romantic partner; classmate; co-worker; and native to the same country. In addition to the content of the messages, we examine conversational aspects such as the time messages are sent, messaging frequency, psycholinguistic word categories, linguistic mirroring, and graph-based features reflecting how people in the corpus mention each other. We present two sets of experiments predicting each attribute using (1) short context windows; and (2) a larger set of messages. We find that using all features leads to gains of 9-14% over using message text only.

* Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2019)
* 15 pages accepted to CICLing 2019

Via

Access Paper or Ask Questions