Picture for Hari Sundaram

Hari Sundaram

CausalDetox: Causal Head Selection and Intervention for Language Model Detoxification

Add code
Apr 16, 2026
Viaarxiv icon

Masking or Mitigating? Deconstructing the Impact of Query Rewriting on Retriever Biases in RAG

Add code
Apr 07, 2026
Viaarxiv icon

From Plausible to Causal: Counterfactual Semantics for Policy Evaluation in Simulated Online Communities

Add code
Apr 05, 2026
Viaarxiv icon

AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?

Add code
Mar 20, 2026
Viaarxiv icon

Social Simulacra in the Wild: AI Agent Communities on Moltbook

Add code
Mar 17, 2026
Viaarxiv icon

DIVEBATCH: Accelerating Model Training Through Gradient-Diversity Aware Batch Size Adaptation

Add code
Sep 19, 2025
Viaarxiv icon

Multi-modal Relational Item Representation Learning for Inferring Substitutable and Complementary Items

Add code
Jul 29, 2025
Figure 1 for Multi-modal Relational Item Representation Learning for Inferring Substitutable and Complementary Items
Figure 2 for Multi-modal Relational Item Representation Learning for Inferring Substitutable and Complementary Items
Figure 3 for Multi-modal Relational Item Representation Learning for Inferring Substitutable and Complementary Items
Figure 4 for Multi-modal Relational Item Representation Learning for Inferring Substitutable and Complementary Items
Viaarxiv icon

On the Necessity of Output Distribution Reweighting for Effective Class Unlearning

Add code
Jun 25, 2025
Viaarxiv icon

Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders

Add code
May 20, 2025
Viaarxiv icon

Algorithmic Collective Action with Two Collectives

Add code
Apr 30, 2025
Figure 1 for Algorithmic Collective Action with Two Collectives
Figure 2 for Algorithmic Collective Action with Two Collectives
Figure 3 for Algorithmic Collective Action with Two Collectives
Figure 4 for Algorithmic Collective Action with Two Collectives
Viaarxiv icon