Picture for Sarthak Roy

Sarthak Roy

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

Add code
Jun 27, 2024
Figure 1 for Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management
Figure 2 for Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management
Figure 3 for Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management
Figure 4 for Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management
Viaarxiv icon

Probing LLMs for hate speech detection: strengths and vulnerabilities

Add code
Oct 28, 2023
Figure 1 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 2 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 3 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 4 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Viaarxiv icon