Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Mar 09, 2021

Nikolay Babakov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

Figure 1 for Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Figure 2 for Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Figure 3 for Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Figure 4 for Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Share this with someone who'll enjoy it:

Abstract:Not all topics are equally "flammable" in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labeling a dataset for appropriateness. While toxicity in user-generated data is well-studied, we aim at defining a more fine-grained notion of inappropriateness. The core of inappropriateness is that it can harm the reputation of a speaker. This is different from toxicity in two respects: (i) inappropriateness is topic-related, and (ii) inappropriate message is not toxic but still unacceptable. We collect and release two datasets for Russian: a topic-labeled dataset and an appropriateness-labeled dataset. We also release pre-trained classification models trained on this data.

* Accepted to the Balto-Slavic NLP workshop 2021 co-located with EACL-2021

View paper on

Share this with someone who'll enjoy it:

Title:Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Paper and Code