Picture for Dimitri Bertsekas

Dimitri Bertsekas

Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout

Add code
Sep 10, 2024
Viaarxiv icon

An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking

Add code
May 24, 2024
Viaarxiv icon

Most Likely Sequence Generation for $n$-Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms

Add code
Mar 19, 2024
Viaarxiv icon

Approximate Multiagent Reinforcement Learning for On-Demand Urban Mobility Problem on a Large Map (extended version)

Add code
Nov 02, 2023
Viaarxiv icon

Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation

Add code
Dec 29, 2022
Viaarxiv icon

Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach

Add code
Nov 29, 2022
Viaarxiv icon

Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand

Add code
Nov 28, 2022
Viaarxiv icon

New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning

Add code
Jul 19, 2022
Figure 1 for New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Figure 2 for New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Figure 3 for New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Figure 4 for New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Viaarxiv icon

Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control

Add code
Aug 20, 2021
Figure 1 for Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
Figure 2 for Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
Figure 3 for Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
Figure 4 for Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
Viaarxiv icon

On-Line Policy Iteration for Infinite Horizon Dynamic Programming

Add code
Jun 01, 2021
Viaarxiv icon