Picture for Souradip Chakraborty

Souradip Chakraborty

Agentic Critical Training

Add code
Mar 09, 2026
Viaarxiv icon

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Add code
Feb 26, 2026
Viaarxiv icon

Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away

Add code
Feb 11, 2026
Viaarxiv icon

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Add code
Oct 06, 2025
Figure 1 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 2 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 3 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 4 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Viaarxiv icon

MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models

Add code
Oct 02, 2025
Viaarxiv icon

Enhancing Diversity in Large Language Models via Determinantal Point Processes

Add code
Sep 05, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Figure 1 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 2 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 3 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 4 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Viaarxiv icon

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Add code
May 29, 2025
Viaarxiv icon

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Add code
Apr 02, 2025
Viaarxiv icon

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Add code
Mar 27, 2025
Viaarxiv icon