Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lucas Dixon

KNNs of Semantic Encodings for Rating Prediction

Feb 01, 2023

Léo Laugier, Thomas Bonald, Lucas Dixon, Raghuram Vadapalli

Abstract:This paper explores a novel application of textual semantic similarity to user-preference representation for rating prediction. The approach represents a user's preferences as a graph of textual snippets from review text, where the edges are defined by semantic similarity. This textual, memory-based approach to rating prediction enables review-based explanations for recommendations. The method is evaluated quantitatively, highlighting that leveraging text in this way outperforms both strong memory-based and model-based collaborative filtering baselines.

Via

Access Paper or Ask Questions

Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

Jun 17, 2022

Shayegan Omidshafiei, Andrei Kapishnikov, Yannick Assogba, Lucas Dixon, Been Kim

Figure 1 for Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

Figure 2 for Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

Figure 3 for Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

Figure 4 for Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

Abstract:Each year, expert-level performance is attained in increasingly-complex multiagent domains, notable examples including Go, Poker, and StarCraft II. This rapid progression is accompanied by a commensurate need to better understand how such agents attain this performance, to enable their safe deployment, identify limitations, and reveal potential means of improving them. In this paper we take a step back from performance-focused multiagent learning, and instead turn our attention towards agent behavior analysis. We introduce a model-agnostic method for discovery of behavior clusters in multiagent domains, using variational inference to learn a hierarchy of behaviors at the joint and local agent levels. Our framework makes no assumption about agents' underlying learning algorithms, does not require access to their latent states or models, and can be trained using entirely offline observational data. We illustrate the effectiveness of our method for enabling the coupled understanding of behaviors at the joint and local agent level, detection of behavior changepoints throughout training, discovery of core behavioral concepts (e.g., those that facilitate higher returns), and demonstrate the approach's scalability to a high-dimensional multiagent MuJoCo control domain.

Via

Access Paper or Ask Questions

On Natural Language User Profiles for Transparent and Scrutable Recommendation

May 19, 2022

Filip Radlinski, Krisztian Balog, Fernando Diaz, Lucas Dixon, Ben Wedin

Figure 1 for On Natural Language User Profiles for Transparent and Scrutable Recommendation

Figure 2 for On Natural Language User Profiles for Transparent and Scrutable Recommendation

Figure 3 for On Natural Language User Profiles for Transparent and Scrutable Recommendation

Figure 4 for On Natural Language User Profiles for Transparent and Scrutable Recommendation

Abstract:Natural interaction with recommendation and personalized search systems has received tremendous attention in recent years. We focus on the challenge of supporting people's understanding and control of these systems and explore a fundamentally new way of thinking about representation of knowledge in recommendation and personalization systems. Specifically, we argue that it may be both desirable and possible for algorithms that use natural language representations of users' preferences to be developed. We make the case that this could provide significantly greater transparency, as well as affordances for practical actionable interrogation of, and control over, recommendations. Moreover, we argue that such an approach, if successfully applied, may enable a major step towards systems that rely less on noisy implicit observations while increasing portability of knowledge of one's interests.

* Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '22), 2022

Via

Access Paper or Ask Questions

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Dec 13, 2021

Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat(+17 more)

Figure 1 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Figure 2 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Figure 3 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Figure 4 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Abstract:Scaling language models with more data, compute and parameters has driven significant progress in natural language processing. For example, thanks to scaling, GPT-3 was able to achieve strong results on in-context learning tasks. However, training these large dense models requires significant amounts of computing resources. In this paper, we propose and develop a family of language models named GLaM (Generalist Language Model), which uses a sparsely activated mixture-of-experts architecture to scale the model capacity while also incurring substantially less training cost compared to dense variants. The largest GLaM has 1.2 trillion parameters, which is approximately 7x larger than GPT-3. It consumes only 1/3 of the energy used to train GPT-3 and requires half of the computation flops for inference, while still achieving better overall zero-shot and one-shot performance across 29 NLP tasks.

Via

Access Paper or Ask Questions

Toxicity Detection can be Sensitive to the Conversational Context

Nov 19, 2021

Alexandros Xenos, John Pavlopoulos, Ion Androutsopoulos, Lucas Dixon, Jeffrey Sorensen, Leo Laugier

Figure 1 for Toxicity Detection can be Sensitive to the Conversational Context

Figure 2 for Toxicity Detection can be Sensitive to the Conversational Context

Figure 3 for Toxicity Detection can be Sensitive to the Conversational Context

Figure 4 for Toxicity Detection can be Sensitive to the Conversational Context

Abstract:User posts whose perceived toxicity depends on the conversational context are rare in current toxicity detection datasets. Hence, toxicity detectors trained on existing datasets will also tend to disregard context, making the detection of context-sensitive toxicity harder when it does occur. We construct and publicly release a dataset of 10,000 posts with two kinds of toxicity labels: (i) annotators considered each post with the previous one as context; and (ii) annotators had no additional context. Based on this, we introduce a new task, context sensitivity estimation, which aims to identify posts whose perceived toxicity changes if the context (previous post) is also considered. We then evaluate machine learning systems on this task, showing that classifiers of practical quality can be developed, and we show that data augmentation with knowledge distillation can improve the performance further. Such systems could be used to enhance toxicity detection datasets with more context-dependent posts, or to suggest when moderators should consider the parent posts, which often may be unnecessary and may otherwise introduce significant additional cost.

* 13 pages, 8 figures

Via

Access Paper or Ask Questions

Augmenting the User-Item Graph with Textual Similarity Models

Sep 20, 2021

Federico López, Martin Scholz, Jessica Yung, Marie Pellat, Michael Strube, Lucas Dixon

Figure 1 for Augmenting the User-Item Graph with Textual Similarity Models

Figure 2 for Augmenting the User-Item Graph with Textual Similarity Models

Figure 3 for Augmenting the User-Item Graph with Textual Similarity Models

Figure 4 for Augmenting the User-Item Graph with Textual Similarity Models

Abstract:This paper introduces a simple and effective form of data augmentation for recommender systems. A paraphrase similarity model is applied to widely available textual data, such as reviews and product descriptions, yielding new semantic relations that are added to the user-item graph. This increases the density of the graph without needing further labeled data. The data augmentation is evaluated on a variety of recommendation algorithms, using Euclidean, hyperbolic, and complex spaces, and over three categories of Amazon product reviews with differing characteristics. Results show that the data augmentation technique provides significant improvements to all types of models, with the most pronounced gains for knowledge graph-based recommenders, particularly in cold-start settings, leading to state-of-the-art performance.

* 12 pages, 2 figures

Via

Access Paper or Ask Questions

Civil Rephrases Of Toxic Texts With Self-Supervised Transformers

Feb 11, 2021

Leo Laugier, John Pavlopoulos, Jeffrey Sorensen, Lucas Dixon

Figure 1 for Civil Rephrases Of Toxic Texts With Self-Supervised Transformers

Figure 2 for Civil Rephrases Of Toxic Texts With Self-Supervised Transformers

Figure 3 for Civil Rephrases Of Toxic Texts With Self-Supervised Transformers

Figure 4 for Civil Rephrases Of Toxic Texts With Self-Supervised Transformers

Abstract:Platforms that support online commentary, from social networks to news sites, are increasingly leveraging machine learning to assist their moderation efforts. But this process does not typically provide feedback to the author that would help them contribute according to the community guidelines. This is prohibitively time-consuming for human moderators to do, and computational approaches are still nascent. This work focuses on models that can help suggest rephrasings of toxic comments in a more civil manner. Inspired by recent progress in unpaired sequence-to-sequence tasks, a self-supervised learning model is introduced, called CAE-T5. CAE-T5 employs a pre-trained text-to-text transformer, which is fine tuned with a denoising and cyclic auto-encoder loss. Experimenting with the largest toxicity detection dataset to date (Civil Comments) our model generates sentences that are more fluent and better at preserving the initial content compared to earlier text style transfer systems which we compare with using several scoring systems and human evaluation.

Via

Access Paper or Ask Questions

Six Attributes of Unhealthy Conversation

Oct 14, 2020

Ilan Price, Jordan Gifford-Moore, Jory Flemming, Saul Musker, Maayan Roichman, Guillaume Sylvain, Nithum Thain, Lucas Dixon, Jeffrey Sorensen

Figure 1 for Six Attributes of Unhealthy Conversation

Figure 2 for Six Attributes of Unhealthy Conversation

Figure 3 for Six Attributes of Unhealthy Conversation

Figure 4 for Six Attributes of Unhealthy Conversation

Abstract:We present a new dataset of approximately 44000 comments labeled by crowdworkers. Each comment is labelled as either 'healthy' or 'unhealthy', in addition to binary labels for the presence of six potentially 'unhealthy' sub-attributes: (1) hostile; (2) antagonistic, insulting, provocative or trolling; (3) dismissive; (4) condescending or patronising; (5) sarcastic; and/or (6) an unfair generalisation. Each label also has an associated confidence score. We argue that there is a need for datasets which enable research based on a broad notion of 'unhealthy online conversation'. We build this typology to encompass a substantial proportion of the individual comments which contribute to unhealthy online conversation. For some of these attributes, this is the first publicly available dataset of this scale. We explore the quality of the dataset, present some summary statistics and initial models to illustrate the utility of this data, and highlight limitations and directions for further research.

* Appearing in the 4th Workshop on Online Abuse and Harms (2020)

Via

Access Paper or Ask Questions

Toxicity Detection: Does Context Really Matter?

Jun 01, 2020

John Pavlopoulos, Jeffrey Sorensen, Lucas Dixon, Nithum Thain, Ion Androutsopoulos

Figure 1 for Toxicity Detection: Does Context Really Matter?

Figure 2 for Toxicity Detection: Does Context Really Matter?

Figure 3 for Toxicity Detection: Does Context Really Matter?

Figure 4 for Toxicity Detection: Does Context Really Matter?

Abstract:Moderation is crucial to promoting healthy on-line discussions. Although several `toxicity' detection datasets and models have been published, most of them ignore the context of the posts, implicitly assuming that comments maybe judged independently. We investigate this assumption by focusing on two questions: (a) does context affect the human judgement, and (b) does conditioning on context improve performance of toxicity detection systems? We experiment with Wikipedia conversations, limiting the notion of context to the previous post in the thread and the discussion title. We find that context can both amplify or mitigate the perceived toxicity of posts. Moreover, a small but significant subset of manually labeled posts (5% in one of our experiments) end up having the opposite toxicity labels if the annotators are not provided with context. Surprisingly, we also find no evidence that context actually improves the performance of toxicity classifiers, having tried a range of classifiers and mechanisms to make them context aware. This points to the need for larger datasets of comments annotated in context. We make our code and data publicly available.

Via

Access Paper or Ask Questions

Classifying Constructive Comments

Apr 14, 2020

Varada Kolhatkar, Nithum Thain, Jeffrey Sorensen, Lucas Dixon, Maite Taboada

Figure 1 for Classifying Constructive Comments

Figure 2 for Classifying Constructive Comments

Figure 3 for Classifying Constructive Comments

Figure 4 for Classifying Constructive Comments

Abstract:We introduce the Constructive Comments Corpus (C3), comprised of 12,000 annotated news comments, intended to help build new tools for online communities to improve the quality of their discussions. We define constructive comments as high-quality comments that make a contribution to the conversation. We explain the crowd worker annotation scheme and define a taxonomy of sub-characteristics of constructiveness. The quality of the annotation scheme and the resulting dataset is evaluated using measurements of inter-annotator agreement, expert assessment of a sample, and by the constructiveness sub-characteristics, which we show provide a proxy for the general constructiveness concept. We provide models for constructiveness trained on C3 using both feature-based and a variety of deep learning approaches and demonstrate that these models capture general rather than topic- or domain-specific characteristics of constructiveness, through domain adaptation experiments. We examine the role that length plays in our models, as comment length could be easily gamed if models depend heavily upon this feature. By examining the errors made by each model and their distribution by length, we show that the best performing models are less correlated with comment length.The constructiveness corpus and our experiments pave the way for a moderation tool focused on promoting comments that make a contribution, rather than only filtering out undesirable content.

* Withdrawn pending a new version

Via

Access Paper or Ask Questions