Alert button
Picture for Aldo Pacchiano

Aldo Pacchiano

Alert button

Provable Interactive Learning with Hindsight Instruction Feedback

Add code
Bookmark button
Alert button
Apr 14, 2024
Dipendra Misra, Aldo Pacchiano, Robert E. Schapire

Viaarxiv icon

Multiple-policy Evaluation via Density Estimation

Add code
Bookmark button
Alert button
Mar 29, 2024
Yilei Chen, Aldo Pacchiano, Ioannis Ch. Paschalidis

Viaarxiv icon

Provably Sample Efficient RLHF via Active Preference Optimization

Add code
Bookmark button
Alert button
Feb 16, 2024
Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury

Viaarxiv icon

A Framework for Partially Observed Reward-States in RLHF

Add code
Bookmark button
Alert button
Feb 05, 2024
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

Viaarxiv icon

Contextual Bandits with Stage-wise Constraints

Add code
Bookmark button
Alert button
Jan 15, 2024
Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett

Viaarxiv icon

Experiment Planning with Function Approximation

Add code
Bookmark button
Alert button
Jan 10, 2024
Aldo Pacchiano, Jonathan N. Lee, Emma Brunskill

Viaarxiv icon

Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem

Add code
Bookmark button
Alert button
Aug 15, 2023
Elena Gal, Shaun Singh, Aldo Pacchiano, Ben Walker, Terry Lyons, Jakob Foerster

Figure 1 for Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Figure 2 for Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Figure 3 for Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Figure 4 for Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Viaarxiv icon

Anytime Model Selection in Linear Bandits

Add code
Bookmark button
Alert button
Jul 24, 2023
Parnian Kassraie, Aldo Pacchiano, Nicolas Emmenegger, Andreas Krause

Figure 1 for Anytime Model Selection in Linear Bandits
Figure 2 for Anytime Model Selection in Linear Bandits
Figure 3 for Anytime Model Selection in Linear Bandits
Figure 4 for Anytime Model Selection in Linear Bandits
Viaarxiv icon

Supervised Pretraining Can Learn In-Context Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 26, 2023
Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill

Figure 1 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 2 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 3 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Figure 4 for Supervised Pretraining Can Learn In-Context Reinforcement Learning
Viaarxiv icon

A Unified Model and Dimension for Interactive Estimation

Add code
Bookmark button
Alert button
Jun 09, 2023
Nataly Brukhim, Miroslav Dudik, Aldo Pacchiano, Robert Schapire

Figure 1 for A Unified Model and Dimension for Interactive Estimation
Viaarxiv icon