Picture for Vinija Jain

Vinija Jain

Findings of the Counter Turing Test: AI-Generated Image Detection

Add code
May 21, 2026
Viaarxiv icon

SleepWalk: A Three-Tier Benchmark for Stress-Testing Instruction-Guided Vision-Language Navigation

Add code
May 11, 2026
Viaarxiv icon

Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability

Add code
May 04, 2026
Viaarxiv icon

Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation

Add code
Apr 26, 2026
Viaarxiv icon

CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation

Add code
Apr 10, 2026
Viaarxiv icon

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Add code
Mar 23, 2026
Viaarxiv icon

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Add code
Mar 03, 2026
Viaarxiv icon

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift

Add code
Mar 01, 2026
Viaarxiv icon

Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs

Add code
Feb 01, 2026
Viaarxiv icon