Picture for Jean Harb

Jean Harb

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

Add code
Oct 12, 2023
Figure 1 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 2 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 3 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 4 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Viaarxiv icon

General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States

Add code
Jul 04, 2022
Figure 1 for General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Figure 2 for General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Figure 3 for General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Figure 4 for General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Viaarxiv icon

Policy Evaluation Networks

Add code
Feb 26, 2020
Figure 1 for Policy Evaluation Networks
Figure 2 for Policy Evaluation Networks
Figure 3 for Policy Evaluation Networks
Figure 4 for Policy Evaluation Networks
Viaarxiv icon

The Barbados 2018 List of Open Issues in Continual Learning

Add code
Nov 16, 2018
Viaarxiv icon

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Add code
Jan 16, 2018
Figure 1 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 2 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 3 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 4 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Viaarxiv icon

Learnings Options End-to-End for Continuous Action Tasks

Add code
Nov 30, 2017
Figure 1 for Learnings Options End-to-End for Continuous Action Tasks
Viaarxiv icon

When Waiting is not an Option : Learning Options with a Deliberation Cost

Add code
Sep 14, 2017
Figure 1 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Figure 2 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Figure 3 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Viaarxiv icon

Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Add code
Apr 18, 2017
Figure 1 for Investigating Recurrence and Eligibility Traces in Deep Q-Networks
Figure 2 for Investigating Recurrence and Eligibility Traces in Deep Q-Networks
Figure 3 for Investigating Recurrence and Eligibility Traces in Deep Q-Networks
Figure 4 for Investigating Recurrence and Eligibility Traces in Deep Q-Networks
Viaarxiv icon

The Option-Critic Architecture

Add code
Dec 03, 2016
Figure 1 for The Option-Critic Architecture
Figure 2 for The Option-Critic Architecture
Figure 3 for The Option-Critic Architecture
Figure 4 for The Option-Critic Architecture
Viaarxiv icon