Alert button
Picture for Dale Schuurmans

Dale Schuurmans

Alert button

Learning Universal Policies via Text-Guided Video Generation

Add code
Bookmark button
Alert button
Feb 02, 2023
Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel

Figure 1 for Learning Universal Policies via Text-Guided Video Generation
Figure 2 for Learning Universal Policies via Text-Guided Video Generation
Figure 3 for Learning Universal Policies via Text-Guided Video Generation
Figure 4 for Learning Universal Policies via Text-Guided Video Generation
Viaarxiv icon

The Role of Baselines in Policy Gradient Optimization

Add code
Bookmark button
Alert button
Jan 16, 2023
Jincheng Mei, Wesley Chung, Valentin Thomas, Bo Dai, Csaba Szepesvari, Dale Schuurmans

Figure 1 for The Role of Baselines in Policy Gradient Optimization
Figure 2 for The Role of Baselines in Policy Gradient Optimization
Figure 3 for The Role of Baselines in Policy Gradient Optimization
Viaarxiv icon

Memory Augmented Large Language Models are Computationally Universal

Add code
Bookmark button
Alert button
Jan 10, 2023
Dale Schuurmans

Figure 1 for Memory Augmented Large Language Models are Computationally Universal
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Bookmark button
Alert button
Dec 17, 2022
Zichen Zhang, Johannes Kirschner, Junxi Zhang, Francesco Zanini, Alex Ayoub, Masood Dehghan, Dale Schuurmans

Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

Latent Variable Representation for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2022
Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai

Figure 1 for Latent Variable Representation for Reinforcement Learning
Figure 2 for Latent Variable Representation for Reinforcement Learning
Figure 3 for Latent Variable Representation for Reinforcement Learning
Figure 4 for Latent Variable Representation for Reinforcement Learning
Viaarxiv icon

A Simple Decentralized Cross-Entropy Method

Add code
Bookmark button
Alert button
Dec 16, 2022
Zichen Zhang, Jun Jin, Martin Jagersand, Jun Luo, Dale Schuurmans

Figure 1 for A Simple Decentralized Cross-Entropy Method
Figure 2 for A Simple Decentralized Cross-Entropy Method
Figure 3 for A Simple Decentralized Cross-Entropy Method
Figure 4 for A Simple Decentralized Cross-Entropy Method
Viaarxiv icon

Score-based Continuous-time Discrete Diffusion Models

Add code
Bookmark button
Alert button
Nov 30, 2022
Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai

Figure 1 for Score-based Continuous-time Discrete Diffusion Models
Figure 2 for Score-based Continuous-time Discrete Diffusion Models
Figure 3 for Score-based Continuous-time Discrete Diffusion Models
Figure 4 for Score-based Continuous-time Discrete Diffusion Models
Viaarxiv icon

What learning algorithm is in-context learning? Investigations with linear models

Add code
Bookmark button
Alert button
Nov 29, 2022
Ekin Akyürek, Dale Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou

Figure 1 for What learning algorithm is in-context learning? Investigations with linear models
Figure 2 for What learning algorithm is in-context learning? Investigations with linear models
Figure 3 for What learning algorithm is in-context learning? Investigations with linear models
Figure 4 for What learning algorithm is in-context learning? Investigations with linear models
Viaarxiv icon

TEMPERA: Test-Time Prompting via Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 21, 2022
Tianjun Zhang, Xuezhi Wang, Denny Zhou, Dale Schuurmans, Joseph E. Gonzalez

Figure 1 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 2 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 3 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Figure 4 for TEMPERA: Test-Time Prompting via Reinforcement Learning
Viaarxiv icon

Learning to Optimize with Stochastic Dominance Constraints

Add code
Bookmark button
Alert button
Nov 21, 2022
Hanjun Dai, Yuan Xue, Niao He, Bethany Wang, Na Li, Dale Schuurmans, Bo Dai

Figure 1 for Learning to Optimize with Stochastic Dominance Constraints
Figure 2 for Learning to Optimize with Stochastic Dominance Constraints
Figure 3 for Learning to Optimize with Stochastic Dominance Constraints
Figure 4 for Learning to Optimize with Stochastic Dominance Constraints
Viaarxiv icon