Picture for Sattar Vakili

Sattar Vakili

Many Needles in a Haystack: Active Hit Discovery for Perturbation Experiments

Add code
May 11, 2026
Viaarxiv icon

A Finite Time Analysis of Thompson Sampling for Bayesian Optimization with Preferential Feedback

Add code
Apr 27, 2026
Viaarxiv icon

MixFlow: Mixture-Conditioned Flow Matching for Out-of-Distribution Generalization

Add code
Jan 16, 2026
Viaarxiv icon

Reinforcement Learning Using known Invariances

Add code
Nov 05, 2025
Viaarxiv icon

No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes

Add code
Oct 23, 2025
Viaarxiv icon

Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds

Add code
May 29, 2025
Figure 1 for Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Figure 2 for Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Figure 3 for Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Figure 4 for Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Viaarxiv icon

Towards a Foundation Model for Communication Systems

Add code
May 20, 2025
Figure 1 for Towards a Foundation Model for Communication Systems
Figure 2 for Towards a Foundation Model for Communication Systems
Figure 3 for Towards a Foundation Model for Communication Systems
Figure 4 for Towards a Foundation Model for Communication Systems
Viaarxiv icon

Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity

Add code
May 16, 2025
Viaarxiv icon

Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning

Add code
Feb 11, 2025
Figure 1 for Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning
Figure 2 for Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning
Figure 3 for Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning
Figure 4 for Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning
Viaarxiv icon

Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm

Add code
Oct 30, 2024
Figure 1 for Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
Viaarxiv icon