Picture for Subramanyam Sahoo

Subramanyam Sahoo

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Add code
Mar 03, 2026
Viaarxiv icon

The Controllability Trap: A Governance Framework for Military AI Agents

Add code
Mar 03, 2026
Viaarxiv icon

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift

Add code
Mar 01, 2026
Viaarxiv icon

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds

Add code
Dec 25, 2025
Figure 1 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 2 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 3 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 4 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Viaarxiv icon

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces

Add code
Dec 15, 2025
Viaarxiv icon

The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training

Add code
Nov 17, 2025
Figure 1 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 2 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 3 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 4 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Viaarxiv icon

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Add code
Nov 06, 2025
Figure 1 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 2 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 3 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 4 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Viaarxiv icon

Boardwalk Empire: How Generative AI is Revolutionizing Economic Paradigms

Add code
Oct 22, 2024
Viaarxiv icon