Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models

Add code
May 21, 2022
Figure 1 for Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Figure 2 for Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Figure 3 for Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Figure 4 for Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Viaarxiv icon

Reinforcement Teaching

Add code
Apr 25, 2022
Figure 1 for Reinforcement Teaching
Figure 2 for Reinforcement Teaching
Figure 3 for Reinforcement Teaching
Figure 4 for Reinforcement Teaching
Viaarxiv icon

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Add code
Apr 06, 2022
Figure 1 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 2 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 3 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 4 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Viaarxiv icon

Chain of Thought Prompting Elicits Reasoning in Large Language Models

Add code
Jan 28, 2022
Figure 1 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 2 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 3 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Figure 4 for Chain of Thought Prompting Elicits Reasoning in Large Language Models
Viaarxiv icon

Neural Stochastic Dual Dynamic Programming

Add code
Dec 01, 2021
Figure 1 for Neural Stochastic Dual Dynamic Programming
Figure 2 for Neural Stochastic Dual Dynamic Programming
Figure 3 for Neural Stochastic Dual Dynamic Programming
Figure 4 for Neural Stochastic Dual Dynamic Programming
Viaarxiv icon

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Add code
Nov 01, 2021
Figure 1 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 2 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 3 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Figure 4 for SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs
Viaarxiv icon

Understanding the Effect of Stochasticity in Policy Optimization

Add code
Oct 29, 2021
Figure 1 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 2 for Understanding the Effect of Stochasticity in Policy Optimization
Figure 3 for Understanding the Effect of Stochasticity in Policy Optimization
Viaarxiv icon

Combiner: Full Attention Transformer with Sparse Computation Cost

Add code
Jul 12, 2021
Figure 1 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 2 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 3 for Combiner: Full Attention Transformer with Sparse Computation Cost
Figure 4 for Combiner: Full Attention Transformer with Sparse Computation Cost
Viaarxiv icon

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

Add code
Jun 18, 2021
Figure 1 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Figure 2 for On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data
Viaarxiv icon

Characterizing the Gap Between Actor-Critic and Policy Gradient

Add code
Jun 13, 2021
Figure 1 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 2 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 3 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 4 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Viaarxiv icon