Picture for Vaneet Aggarwal

Vaneet Aggarwal

Efficient $Q$-Learning and Actor-Critic Methods for Robust Average Reward Reinforcement Learning

Add code
Jun 08, 2025
Viaarxiv icon

Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach

Add code
May 26, 2025
Viaarxiv icon

Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

Add code
May 23, 2025
Viaarxiv icon

Global Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm

Add code
May 21, 2025
Viaarxiv icon

Rack Position Optimization in Large-Scale Heterogeneous Data Centers

Add code
Mar 31, 2025
Viaarxiv icon

BalancedDPO: Adaptive Multi-Metric Alignment

Add code
Mar 16, 2025
Viaarxiv icon

Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback

Add code
Mar 15, 2025
Viaarxiv icon

Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning

Add code
Feb 24, 2025
Viaarxiv icon

Order-Optimal Projection-Free Algorithm for Adversarially Constrained Online Convex Optimization

Add code
Feb 23, 2025
Viaarxiv icon

Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants

Add code
Feb 06, 2025
Viaarxiv icon