Alert button
Picture for Dale Schuurmans

Dale Schuurmans

Alert button

Reinforcement Teaching

Add code
Bookmark button
Alert button
Apr 25, 2022
Alex Lewandowski, Calarina Muslimani, Matthew E. Taylor, Jun Luo, Dale Schuurmans

Figure 1 for Reinforcement Teaching
Figure 2 for Reinforcement Teaching
Figure 3 for Reinforcement Teaching
Figure 4 for Reinforcement Teaching
Viaarxiv icon

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Add code
Bookmark button
Alert button
Apr 06, 2022
Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou

Figure 1 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 2 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 3 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 4 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Viaarxiv icon

Chain of Thought Prompting Elicits Reasoning in Large Language Models

Add code
Bookmark button
Alert button
Jan 28, 2022
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Ed Chi, Quoc Le, Denny Zhou

Figure 1 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 2 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 3 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 4 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Viaarxiv icon

Neural Stochastic Dual Dynamic Programming

Add code
Bookmark button
Alert button
Dec 01, 2021
Hanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai

Figure 1 for Neural Stochastic Dual Dynamic Programming
Figure 2 for Neural Stochastic Dual Dynamic Programming
Figure 3 for Neural Stochastic Dual Dynamic Programming
Figure 4 for Neural Stochastic Dual Dynamic Programming
Viaarxiv icon

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Add code
Bookmark button
Alert button
Nov 01, 2021
Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans

Figure 1 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 2 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 3 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 4 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Viaarxiv icon

Understanding the Effect of Stochasticity in Policy Optimization

Add code
Bookmark button
Alert button
Oct 29, 2021
Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans

Figure 1 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 2 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 3 for Understanding the Effect of Stochasticity in Policy Optimization
Viaarxiv icon

Combiner: Full Attention Transformer with Sparse Computation Cost

Add code
Bookmark button
Alert button
Jul 12, 2021
Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai

Figure 1 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 2 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 3 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 4 for Combiner: Full Attention Transformer with Sparse Computation Cost
Viaarxiv icon

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

Add code
Bookmark button
Alert button
Jun 18, 2021
Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvari

Figure 1 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Figure 2 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Viaarxiv icon

Characterizing the Gap Between Actor-Critic and Policy Gradient

Add code
Bookmark button
Alert button
Jun 13, 2021
Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans

Figure 1 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 2 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 3 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 4 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Viaarxiv icon

Leveraging Non-uniformity in First-order Non-convex Optimization

Add code
Bookmark button
Alert button
May 13, 2021
Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvari, Dale Schuurmans

Figure 1 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 2 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 3 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 4 for Leveraging Non-uniformity in First-order Non-convex Optimization
Viaarxiv icon