Picture for Aman Chadha

Aman Chadha

SleepWalk: A Three-Tier Benchmark for Stress-Testing Instruction-Guided Vision-Language Navigation

Add code
May 11, 2026
Viaarxiv icon

Prediction Bottlenecks Don't Discover Causal Structure (But Here's What They Actually Do)

Add code
May 09, 2026
Viaarxiv icon

Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability

Add code
May 04, 2026
Viaarxiv icon

Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation

Add code
Apr 26, 2026
Viaarxiv icon

Don't Make the LLM Read the Graph: Make the Graph Think

Add code
Apr 24, 2026
Viaarxiv icon

CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation

Add code
Apr 10, 2026
Viaarxiv icon

Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

Add code
Apr 07, 2026
Viaarxiv icon

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Add code
Mar 23, 2026
Viaarxiv icon

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Add code
Mar 03, 2026
Viaarxiv icon