Picture for Deep Mehta

Deep Mehta

CanaryBench: Stress Testing Privacy Leakage in Cluster-Level Conversation Summaries

Add code
Jan 25, 2026
Viaarxiv icon

Does Inference Scaling Improve Reasoning Faithfulness? A Multi-Model Analysis of Self-Consistency Tradeoffs

Add code
Jan 10, 2026
Viaarxiv icon