Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Generative Hierarchical Materials Search

Add code
Sep 10, 2024
Viaarxiv icon

Exploring and Benchmarking the Planning Capabilities of Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Learning Continually by Spectral Regularization

Add code
Jun 10, 2024
Viaarxiv icon

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

Add code
May 31, 2024
Viaarxiv icon

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Add code
May 29, 2024
Viaarxiv icon

Soft Preference Optimization: Aligning Language Models to Expert Distributions

Add code
Apr 30, 2024
Viaarxiv icon

Video as the New Language for Real-World Decision Making

Add code
Feb 27, 2024
Viaarxiv icon

Stochastic Gradient Succeeds for Bandits

Add code
Feb 27, 2024
Viaarxiv icon

Beyond Expectations: Learning with Stochastic Dominance Made Practical

Add code
Feb 05, 2024
Viaarxiv icon

Curvature Explains Loss of Plasticity

Add code
Nov 30, 2023
Viaarxiv icon