Picture for Mudit Gaur

Mudit Gaur

Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

Add code
May 23, 2025
Viaarxiv icon

On The Global Convergence Of Online RLHF With Neural Parametrization

Add code
Oct 21, 2024
Viaarxiv icon

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

Add code
May 06, 2024
Viaarxiv icon

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

Add code
Jun 18, 2023
Viaarxiv icon

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

Add code
Nov 14, 2022
Viaarxiv icon