Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

"Topic": models, code, and papers

Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms

May 20, 2022
Patrick Lee, Martha Gavidia, Anna Feldman, Jing Peng

This paper presents a linguistically driven proof of concept for finding potentially euphemistic terms, or PETs. Acknowledging that PETs tend to be commonly used expressions for a certain range of sensitive topics, we make use of distributional similarities to select and filter phrase candidates from a sentence and rank them using a set of simple sentiment-based metrics. We present the results of our approach tested on a corpus of sentences containing euphemisms, demonstrating its efficacy for detecting single and multi-word PETs from a broad range of topics. We also discuss future potential for sentiment-based methods on this task.

* Proceedings of UnImplicit: The Second Workshop on Understanding Implicit and Underspecified Language, NAACL 2022, Seattle 

  Access Paper or Ask Questions

Exploration of Gender Differences in COVID-19 Discourse on Reddit

Aug 13, 2020
Jai Aggarwal, Ella Rabinovich, Suzanne Stevenson

Decades of research on differences in the language of men and women have established postulates about preferences in lexical, topical, and emotional expression between the two genders, along with their sociological underpinnings. Using a novel dataset of male and female linguistic productions collected from the Reddit discussion platform, we further confirm existing assumptions about gender-linked affective distinctions, and demonstrate that these distinctions are amplified in social media postings involving emotionally-charged discourse related to COVID-19. Our analysis also confirms considerable differences in topical preferences between male and female authors in spontaneous pandemic-related discussions.

* Proceedings of the 1st Workshop on NLP for COVID-19 (ACL 2020) 

  Access Paper or Ask Questions

A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Sep 09, 2021
Xiaocong Chen, Lina Yao, Julian McAuley, Guanglin Zhou, Xianzhi Wang

In light of the emergence of deep reinforcement learning (DRL) in recommender systems research and several fruitful results in recent years, this survey aims to provide a timely and comprehensive overview of the recent trends of deep reinforcement learning in recommender systems. We start with the motivation of applying DRL in recommender systems. Then, we provide a taxonomy of current DRL-based recommender systems and a summary of existing methods. We discuss emerging topics and open issues, and provide our perspective on advancing the domain. This survey serves as introductory material for readers from academia and industry into the topic and identifies notable opportunities for further research.

  Access Paper or Ask Questions

Neural Temporal Opinion Modelling for Opinion Prediction on Twitter

May 27, 2020
Lixing Zhu, Yulan He, Deyu Zhou

Opinion prediction on Twitter is challenging due to the transient nature of tweet content and neighbourhood context. In this paper, we model users' tweet posting behaviour as a temporal point process to jointly predict the posting time and the stance label of the next tweet given a user's historical tweet sequence and tweets posted by their neighbours. We design a topic-driven attention mechanism to capture the dynamic topic shifts in the neighbourhood context. Experimental results show that the proposed model predicts both the posting time and the stance labels of future tweets more accurately compared to a number of competitive baselines.

* To appear at ACL2020 

  Access Paper or Ask Questions

Dirichlet-vMF Mixture Model

Feb 24, 2017
Shaohua Li

This document is about the multi-document Von-Mises-Fisher mixture model with a Dirichlet prior, referred to as VMFMix. VMFMix is analogous to Latent Dirichlet Allocation (LDA) in that they can capture the co-occurrence patterns acorss multiple documents. The difference is that in VMFMix, the topic-word distribution is defined on a continuous n-dimensional hypersphere. Hence VMFMix is used to derive topic embeddings, i.e., representative vectors, from multiple sets of embedding vectors. An efficient Variational Expectation-Maximization inference algorithm is derived. The performance of VMFMix on two document classification tasks is reported, with some preliminary analysis.

* 5 pages 

  Access Paper or Ask Questions

Learning from LDA using Deep Neural Networks

Aug 05, 2015
Dongxu Zhang, Tianyi Luo, Dong Wang, Rong Liu

Latent Dirichlet Allocation (LDA) is a three-level hierarchical Bayesian model for topic inference. In spite of its great success, inferring the latent topic distribution with LDA is time-consuming. Motivated by the transfer learning approach proposed by~\newcite{hinton2015distilling}, we present a novel method that uses LDA to supervise the training of a deep neural network (DNN), so that the DNN can approximate the costly LDA inference with less computation. Our experiments on a document classification task show that a simple DNN can learn the LDA behavior pretty well, while the inference is speeded up tens or hundreds of times.

  Access Paper or Ask Questions

"How Was Your Weekend?" A Generative Model of Phatic Conversation

Feb 13, 2018
Hannah Morrison, Chris Martens

Unspoken social rules, such as those that govern choosing a proper discussion topic and when to change discussion topics, guide conversational behaviors. We propose a computational model of conversation that can follow or break such rules, with participant agents that respond accordingly. Additionally, we demonstrate an application of the model: the Experimental Social Tutor (EST), a first step toward a social skills training tool that generates human-readable conversation and a conversational guideline at each point in the dialogue. Finally, we discuss the design and results of a pilot study evaluating the EST. Results show that our model is capable of producing conversations that follow social norms.

* Accepted submission at FLAIRS-31 

  Access Paper or Ask Questions

A Rule-Based Short Query Intent Identification System

Mar 25, 2015
Arijit De, Sunil Kumar Kopparapu

Using SMS (Short Message System), cell phones can be used to query for information about various topics. In an SMS based search system, one of the key problems is to identify a domain (broad topic) associated with the user query; so that a more comprehensive search can be carried out by the domain specific search engine. In this paper we use a rule based approach, to identify the domain, called Short Query Intent Identification System (SQIIS). We construct two different rule-bases using different strategies to suit query intent identification. We evaluate the two rule-bases experimentally.

* 5 pages, 2010 International Conference on Signal and Image Processing (ICSIP) 

  Access Paper or Ask Questions

Capturing Knowledge of User Preferences: ontologies on recommender systems

Mar 08, 2002
S. E. Middleton, D. C. De Roure, N. R. Shadbolt

Tools for filtering the World Wide Web exist, but they are hampered by the difficulty of capturing user preferences in such a dynamic environment. We explore the acquisition of user profiles by unobtrusive monitoring of browsing behaviour and application of supervised machine-learning techniques coupled with an ontological representation to extract user preferences. A multi-class approach to paper classification is used, allowing the paper topic taxonomy to be utilised during profile construction. The Quickstep recommender system is presented and two empirical studies evaluate it in a real work setting, measuring the effectiveness of using a hierarchical topic ontology compared with an extendable flat list.

* First international conference on Knowledge Capture 2001, 8 pages 

  Access Paper or Ask Questions

Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields

Jan 28, 2018
Stephen L. France, Sanjoy Ghose

Marketing analytics is a diverse field, with both academic researchers and practitioners coming from a range of backgrounds including marketing, operations research, statistics, and computer science. This paper provides an integrative review at the boundary of these three areas. The topics of visualization, segmentation, and class prediction are featured. Links between the disciplines are emphasized. For each of these topics, a historical overview is given, starting with initial work in the 1960s and carrying through to the present day. Recent innovations for modern large and complex "big data" sets are described. Practical implementation advice is given, along with a directory of open source R routines for implementing marketing analytics techniques.

  Access Paper or Ask Questions