Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Spectral Ghost in Representation Learning: from Component Analysis to Self-Supervised Learning

Add code
Jan 28, 2026
Viaarxiv icon

Universal computation is intrinsic to language model decoding

Add code
Jan 12, 2026
Viaarxiv icon

The World Is Bigger! A Computationally-Embedded Perspective on the Big World Hypothesis

Add code
Dec 29, 2025
Viaarxiv icon

Spectral Representation-based Reinforcement Learning

Add code
Dec 17, 2025
Viaarxiv icon

Rethinking the Global Convergence of Softmax Policy Gradient with Linear Function Approximation

Add code
May 06, 2025
Viaarxiv icon

Improving Large Language Model Planning with Action Sequence Similarity

Add code
May 02, 2025
Figure 1 for Improving Large Language Model Planning with Action Sequence Similarity
Figure 2 for Improving Large Language Model Planning with Action Sequence Similarity
Figure 3 for Improving Large Language Model Planning with Action Sequence Similarity
Figure 4 for Improving Large Language Model Planning with Action Sequence Similarity
Viaarxiv icon

Representation Learning via Non-Contrastive Mutual Information

Add code
Apr 23, 2025
Figure 1 for Representation Learning via Non-Contrastive Mutual Information
Figure 2 for Representation Learning via Non-Contrastive Mutual Information
Figure 3 for Representation Learning via Non-Contrastive Mutual Information
Figure 4 for Representation Learning via Non-Contrastive Mutual Information
Viaarxiv icon

Ordering-based Conditions for Global Convergence of Policy Gradient Methods

Add code
Apr 02, 2025
Figure 1 for Ordering-based Conditions for Global Convergence of Policy Gradient Methods
Figure 2 for Ordering-based Conditions for Global Convergence of Policy Gradient Methods
Figure 3 for Ordering-based Conditions for Global Convergence of Policy Gradient Methods
Viaarxiv icon

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates

Add code
Feb 11, 2025
Figure 1 for Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
Figure 2 for Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
Viaarxiv icon

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Add code
Jan 28, 2025
Figure 1 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 2 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 3 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 4 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Viaarxiv icon