Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Add code
Mar 07, 2023
Figure 1 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 2 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 3 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Figure 4 for Foundation Models for Decision Making: Problems, Methods, and Opportunities
Viaarxiv icon

Learning Universal Policies via Text-Guided Video Generation

Add code
Feb 02, 2023
Figure 1 for Learning Universal Policies via Text-Guided Video Generation
Figure 2 for Learning Universal Policies via Text-Guided Video Generation
Figure 3 for Learning Universal Policies via Text-Guided Video Generation
Figure 4 for Learning Universal Policies via Text-Guided Video Generation
Viaarxiv icon

The Role of Baselines in Policy Gradient Optimization

Add code
Jan 16, 2023
Viaarxiv icon

Memory Augmented Large Language Models are Computationally Universal

Add code
Jan 10, 2023
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Dec 17, 2022
Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

Latent Variable Representation for Reinforcement Learning

Add code
Dec 17, 2022
Figure 1 for Latent Variable Representation for Reinforcement Learning
Figure 2 for Latent Variable Representation for Reinforcement Learning
Figure 3 for Latent Variable Representation for Reinforcement Learning
Figure 4 for Latent Variable Representation for Reinforcement Learning
Viaarxiv icon

A Simple Decentralized Cross-Entropy Method

Add code
Dec 16, 2022
Figure 1 for A Simple Decentralized Cross-Entropy Method
Figure 2 for A Simple Decentralized Cross-Entropy Method
Figure 3 for A Simple Decentralized Cross-Entropy Method
Figure 4 for A Simple Decentralized Cross-Entropy Method
Viaarxiv icon

Score-based Continuous-time Discrete Diffusion Models

Add code
Nov 30, 2022
Figure 1 for Score-based Continuous-time Discrete Diffusion Models
Figure 2 for Score-based Continuous-time Discrete Diffusion Models
Figure 3 for Score-based Continuous-time Discrete Diffusion Models
Figure 4 for Score-based Continuous-time Discrete Diffusion Models
Viaarxiv icon

What learning algorithm is in-context learning? Investigations with linear models

Add code
Nov 29, 2022
Figure 1 for What learning algorithm is in-context learning? Investigations with linear models
Figure 2 for What learning algorithm is in-context learning? Investigations with linear models
Figure 3 for What learning algorithm is in-context learning? Investigations with linear models
Figure 4 for What learning algorithm is in-context learning? Investigations with linear models
Viaarxiv icon

TEMPERA: Test-Time Prompting via Reinforcement Learning

Add code
Nov 21, 2022
Figure 1 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 2 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 3 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 4 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Viaarxiv icon