Picture for Mohammad Aflah Khan

Mohammad Aflah Khan

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Add code
Jun 25, 2024
Figure 1 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 2 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 3 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 4 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Viaarxiv icon

Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction

Add code
Apr 19, 2024
Viaarxiv icon

Probing Critical Learning Dynamics of PLMs for Hate Speech Detection

Add code
Feb 03, 2024
Viaarxiv icon

Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection

Add code
Nov 16, 2023
Viaarxiv icon

The Art of Embedding Fusion: Optimizing Hate Speech Detection

Add code
Jun 26, 2023
Figure 1 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 2 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 3 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 4 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Viaarxiv icon

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Add code
Apr 03, 2023
Figure 1 for Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Figure 2 for Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Figure 3 for Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Figure 4 for Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Viaarxiv icon

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

Add code
Jun 08, 2022
Figure 1 for Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization
Figure 2 for Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization
Figure 3 for Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization
Figure 4 for Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization
Viaarxiv icon