Picture for Anuj Sharma

Anuj Sharma

EST-PRM: Stress-Testing Process Reward Models Before They Become Load-Bearing

Add code
May 30, 2026
Viaarxiv icon

Canonicalized Stable-List Replay for Private Federated Continual Learning over Language-Model Embeddings

Add code
May 29, 2026
Viaarxiv icon

Topology-Aware State Abstraction with Tangle Cores for Markov Decision Processes

Add code
May 29, 2026
Viaarxiv icon

Auditing Near-Optimal Policies Can Be Exponentially Hard: Conditional Query Lower Bounds via Occupancy Rashomon Capacity

Add code
May 29, 2026
Viaarxiv icon

Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning

Add code
May 29, 2026
Viaarxiv icon

Grounded Decoding: Retrieval-Anchored Probability Fusion for Faithful RAG

Add code
May 29, 2026
Viaarxiv icon

Continual Calibration: Coverage Can Collapse Before Accuracy in Lifelong LLM Fine-Tuning

Add code
Apr 27, 2026
Viaarxiv icon

Coverage-Based Calibration for Post-Training Quantization via Weighted Set Cover over Outlier Channels

Add code
Apr 27, 2026
Viaarxiv icon

Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence

Add code
Apr 15, 2026
Viaarxiv icon

Beyond Accuracy: A Unified Random Matrix Theory Diagnostic Framework for Crash Classification Models

Add code
Feb 23, 2026
Viaarxiv icon