Picture for Neemesh Yadav

Neemesh Yadav

DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories

Add code
Apr 22, 2026
Viaarxiv icon

MHSafeEval: Role-Aware Interaction-Level Evaluation of Mental Health Safety in Large Language Models

Add code
Apr 20, 2026
Viaarxiv icon

Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games

Add code
May 30, 2025
Viaarxiv icon

Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing

Add code
Mar 07, 2025
Figure 1 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 2 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 3 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 4 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Viaarxiv icon

QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs

Add code
Dec 16, 2024
Figure 1 for QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Figure 2 for QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Figure 3 for QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Figure 4 for QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Viaarxiv icon

Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech

Add code
Jun 06, 2024
Figure 1 for Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Figure 2 for Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Figure 3 for Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Figure 4 for Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Viaarxiv icon

The Art of Embedding Fusion: Optimizing Hate Speech Detection

Add code
Jun 26, 2023
Viaarxiv icon