Alert button
Picture for Alec Koppel

Alec Koppel

Alert button

Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies

Add code
Bookmark button
Alert button
Jun 12, 2022
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Pratap Tokekar, Dinesh Manocha

Figure 1 for Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies
Figure 2 for Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies
Figure 3 for Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies
Figure 4 for Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies
Viaarxiv icon

Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 02, 2022
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Furong Huang, Pratap Tokekar, Dinesh Manocha

Figure 1 for Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Figure 2 for Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Figure 3 for Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Figure 4 for Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Viaarxiv icon

Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation

Add code
Bookmark button
Alert button
Mar 02, 2022
Yulun Tian, Amrit Singh Bedi, Alec Koppel, Miguel Calvo-Fullana, David M. Rosen, Jonathan P. How

Figure 1 for Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation
Figure 2 for Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation
Figure 3 for Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation
Figure 4 for Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation
Viaarxiv icon

On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

Add code
Bookmark button
Alert button
Jan 31, 2022
Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian Sadler, Pratap Tokekar, Alec Koppel

Figure 1 for On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Figure 2 for On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Figure 3 for On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Figure 4 for On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Viaarxiv icon

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search

Add code
Bookmark button
Alert button
Jan 21, 2022
Wesley A. Suttle, Alec Koppel, Ji Liu

Figure 1 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 2 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 3 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 4 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Viaarxiv icon

Online, Informative MCMC Thinning with Kernelized Stein Discrepancy

Add code
Bookmark button
Alert button
Jan 18, 2022
Cole Hawkins, Alec Koppel, Zheng Zhang

Figure 1 for Online, Informative MCMC Thinning with Kernelized Stein Discrepancy
Figure 2 for Online, Informative MCMC Thinning with Kernelized Stein Discrepancy
Figure 3 for Online, Informative MCMC Thinning with Kernelized Stein Discrepancy
Figure 4 for Online, Informative MCMC Thinning with Kernelized Stein Discrepancy
Viaarxiv icon

Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming

Add code
Bookmark button
Alert button
Oct 22, 2021
Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal

Figure 1 for Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming
Figure 2 for Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming
Figure 3 for Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming
Viaarxiv icon

Distributed Gaussian Process Mapping for Robot Teams with Time-varying Communication

Add code
Bookmark button
Alert button
Oct 12, 2021
James Di, Ehsan Zobeidi, Alec Koppel, Nikolay Atanasov

Figure 1 for Distributed Gaussian Process Mapping for Robot Teams with Time-varying Communication
Figure 2 for Distributed Gaussian Process Mapping for Robot Teams with Time-varying Communication
Figure 3 for Distributed Gaussian Process Mapping for Robot Teams with Time-varying Communication
Figure 4 for Distributed Gaussian Process Mapping for Robot Teams with Time-varying Communication
Viaarxiv icon

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

Add code
Bookmark button
Alert button
Sep 13, 2021
Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal

Figure 1 for Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Figure 2 for Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Figure 3 for Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Viaarxiv icon

Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference

Add code
Bookmark button
Alert button
Jul 26, 2021
Michael E. Kepler, Alec Koppel, Amrit Singh Bedi, Daniel J. Stilwell

Figure 1 for Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference
Figure 2 for Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference
Figure 3 for Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference
Figure 4 for Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference
Viaarxiv icon