Hate Speech Detection


Hate-speech detection is the process of identifying and categorizing hate speech in text data.

xList-Hate: A Checklist-Based Framework for Interpretable and Generalizable Hate Speech Detection

Add code
Feb 05, 2026
Viaarxiv icon

Trust The Typical

Add code
Feb 04, 2026
Viaarxiv icon

LinGO: A Linguistic Graph Optimization Framework with LLMs for Interpreting Intents of Online Uncivil Discourse

Add code
Feb 04, 2026
Viaarxiv icon

Persona Prompting as a Lens on LLM Social Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

SoftHateBench: Evaluating Moderation Models Against Reasoning-Driven, Policy-Compliant Hostility

Add code
Jan 28, 2026
Viaarxiv icon

HateXScore: A Metric Suite for Evaluating Reasoning Quality in Hate Speech Explanations

Add code
Jan 20, 2026
Viaarxiv icon

Bi-Attention HateXplain : Taking into account the sequential aspect of data during explainability in a multi-task context

Add code
Jan 19, 2026
Viaarxiv icon

Improving Implicit Hate Speech Detection via a Community-Driven Multi-Agent Framework

Add code
Jan 14, 2026
Viaarxiv icon

TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech

Add code
Jan 16, 2026
Viaarxiv icon

SyntaxMind at BLP-2025 Task 1: Leveraging Attention Fusion of CNN and GRU for Hate Speech Detection

Add code
Jan 09, 2026
Viaarxiv icon