Picture for Subramanyam Sahoo

Subramanyam Sahoo

From Knowledge to Action: Outcomes of the 2025 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Add code
May 04, 2026
Viaarxiv icon

Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs

Add code
Apr 12, 2026
Viaarxiv icon

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Add code
Mar 03, 2026
Viaarxiv icon

The Controllability Trap: A Governance Framework for Military AI Agents

Add code
Mar 03, 2026
Viaarxiv icon

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift

Add code
Mar 01, 2026
Viaarxiv icon

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds

Add code
Dec 25, 2025
Figure 1 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 2 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 3 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Figure 4 for The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds
Viaarxiv icon

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces

Add code
Dec 15, 2025
Viaarxiv icon

The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training

Add code
Nov 17, 2025
Figure 1 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 2 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 3 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Figure 4 for The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
Viaarxiv icon