Alert button
Picture for Ambuj Tewari

Ambuj Tewari

Alert button

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Mar 06, 2024
Ziping Xu, Zifan Xu, Runxuan Jiang, Peter Stone, Ambuj Tewari

Figure 1 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 2 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 3 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 4 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Viaarxiv icon

Optimal Thresholding Linear Bandit

Feb 11, 2024
Eduardo Ochoa Rivera, Ambuj Tewari

Viaarxiv icon

The Complexity of Sequential Prediction in Dynamical Systems

Feb 09, 2024
Vinod Raman, Unique Subedi, Ambuj Tewari

Viaarxiv icon

A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Low-Rank MDPs

Feb 07, 2024
Kihyuk Hong, Ambuj Tewari

Viaarxiv icon

A Framework for Partially Observed Reward-States in RLHF

Feb 05, 2024
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

Viaarxiv icon

Revisiting the Learnability of Apple Tasting

Oct 29, 2023
Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari

Viaarxiv icon

Sequence Length Independent Norm-Based Generalization Bounds for Transformers

Oct 19, 2023
Jacob Trauger, Ambuj Tewari

Viaarxiv icon

Conformal Contextual Robust Optimization

Oct 16, 2023
Yash Patel, Sahana Rayan, Ambuj Tewari

Viaarxiv icon

On the Computational Complexity of Private High-dimensional Model Selection via the Exponential Mechanism

Oct 11, 2023
Saptarshi Roy, Ambuj Tewari

Viaarxiv icon

Online Infinite-Dimensional Regression: Learning Linear Operators

Sep 21, 2023
Vinod Raman, Unique Subedi, Ambuj Tewari

Viaarxiv icon