Picture for Paul Mineiro

Paul Mineiro

Active, anytime-valid risk controlling prediction sets

Add code
Jun 15, 2024
Viaarxiv icon

Online Joint Fine-tuning of Multi-Agent Flows

Add code
Jun 06, 2024
Viaarxiv icon

Provably Efficient Interactive-Grounded Learning with Personalized Reward

Add code
May 31, 2024
Viaarxiv icon

Aligning LLM Agents by Learning Latent Preference from User Edits

Add code
Apr 23, 2024
Figure 1 for Aligning LLM Agents by Learning Latent Preference from User Edits
Figure 2 for Aligning LLM Agents by Learning Latent Preference from User Edits
Figure 3 for Aligning LLM Agents by Learning Latent Preference from User Edits
Figure 4 for Aligning LLM Agents by Learning Latent Preference from User Edits
Viaarxiv icon

Efficient Contextual Bandits with Uninformed Feedback Graphs

Add code
Feb 12, 2024
Figure 1 for Efficient Contextual Bandits with Uninformed Feedback Graphs
Figure 2 for Efficient Contextual Bandits with Uninformed Feedback Graphs
Viaarxiv icon

Time-uniform confidence bands for the CDF under nonstationarity

Add code
Feb 28, 2023
Figure 1 for Time-uniform confidence bands for the CDF under nonstationarity
Figure 2 for Time-uniform confidence bands for the CDF under nonstationarity
Figure 3 for Time-uniform confidence bands for the CDF under nonstationarity
Figure 4 for Time-uniform confidence bands for the CDF under nonstationarity
Viaarxiv icon

Graph Feedback via Reduction to Regression

Add code
Feb 17, 2023
Figure 1 for Graph Feedback via Reduction to Regression
Figure 2 for Graph Feedback via Reduction to Regression
Figure 3 for Graph Feedback via Reduction to Regression
Viaarxiv icon

Infinite Action Contextual Bandits with Reusable Data Exhaust

Add code
Feb 16, 2023
Figure 1 for Infinite Action Contextual Bandits with Reusable Data Exhaust
Figure 2 for Infinite Action Contextual Bandits with Reusable Data Exhaust
Figure 3 for Infinite Action Contextual Bandits with Reusable Data Exhaust
Viaarxiv icon

Personalized Reward Learning with Interaction-Grounded Learning (IGL)

Add code
Nov 28, 2022
Figure 1 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 2 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 3 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Figure 4 for Personalized Reward Learning with Interaction-Grounded Learning (IGL)
Viaarxiv icon

Towards Data-Driven Offline Simulations for Online Reinforcement Learning

Add code
Nov 14, 2022
Figure 1 for Towards Data-Driven Offline Simulations for Online Reinforcement Learning
Figure 2 for Towards Data-Driven Offline Simulations for Online Reinforcement Learning
Figure 3 for Towards Data-Driven Offline Simulations for Online Reinforcement Learning
Figure 4 for Towards Data-Driven Offline Simulations for Online Reinforcement Learning
Viaarxiv icon