Picture for Dheeraj Narasimha

Dheeraj Narasimha

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Add code
Jul 26, 2025
Viaarxiv icon

Model Predictive Control is Almost Optimal for Restless Bandit

Add code
Oct 08, 2024
Figure 1 for Model Predictive Control is Almost Optimal for Restless Bandit
Figure 2 for Model Predictive Control is Almost Optimal for Restless Bandit
Figure 3 for Model Predictive Control is Almost Optimal for Restless Bandit
Figure 4 for Model Predictive Control is Almost Optimal for Restless Bandit
Viaarxiv icon

CONGO: Compressive Online Gradient Optimization with Application to Microservices Management

Add code
Jul 08, 2024
Viaarxiv icon