Alert button
Picture for Ambuj Tewari

Ambuj Tewari

Alert button

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Add code
Bookmark button
Alert button
Mar 06, 2024
Ziping Xu, Zifan Xu, Runxuan Jiang, Peter Stone, Ambuj Tewari

Figure 1 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 2 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 3 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 4 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Viaarxiv icon

Optimal Thresholding Linear Bandit

Add code
Bookmark button
Alert button
Feb 11, 2024
Eduardo Ochoa Rivera, Ambuj Tewari

Viaarxiv icon

The Complexity of Sequential Prediction in Dynamical Systems

Add code
Bookmark button
Alert button
Feb 09, 2024
Vinod Raman, Unique Subedi, Ambuj Tewari

Viaarxiv icon

A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Low-Rank MDPs

Add code
Bookmark button
Alert button
Feb 07, 2024
Kihyuk Hong, Ambuj Tewari

Viaarxiv icon

A Framework for Partially Observed Reward-States in RLHF

Add code
Bookmark button
Alert button
Feb 05, 2024
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

Viaarxiv icon

Revisiting the Learnability of Apple Tasting

Add code
Bookmark button
Alert button
Oct 29, 2023
Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari

Viaarxiv icon

Sequence Length Independent Norm-Based Generalization Bounds for Transformers

Add code
Bookmark button
Alert button
Oct 19, 2023
Jacob Trauger, Ambuj Tewari

Viaarxiv icon

Conformal Contextual Robust Optimization

Add code
Bookmark button
Alert button
Oct 16, 2023
Yash Patel, Sahana Rayan, Ambuj Tewari

Viaarxiv icon

On the Computational Complexity of Private High-dimensional Model Selection via the Exponential Mechanism

Add code
Bookmark button
Alert button
Oct 11, 2023
Saptarshi Roy, Ambuj Tewari

Viaarxiv icon

Online Infinite-Dimensional Regression: Learning Linear Operators

Add code
Bookmark button
Alert button
Sep 21, 2023
Vinod Raman, Unique Subedi, Ambuj Tewari

Viaarxiv icon