Picture for Dheeraj Narasimha

Dheeraj Narasimha

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Add code
Jul 26, 2025
Viaarxiv icon

Model Predictive Control is Almost Optimal for Restless Bandit

Add code
Oct 08, 2024
Viaarxiv icon

CONGO: Compressive Online Gradient Optimization with Application to Microservices Management

Add code
Jul 08, 2024
Viaarxiv icon