Picture for Martha White

Martha White

Measure-to-measure Regression with Transformers

Add code
May 27, 2026
Viaarxiv icon

Addressing Terminal Constraints in Data-Driven Demand Response Scheduling

Add code
May 14, 2026
Viaarxiv icon

Revisiting Mixture Policies in Entropy-Regularized Actor-Critic

Add code
May 09, 2026
Viaarxiv icon

Regularized Latent Dynamics Prediction is a Strong Baseline For Behavioral Foundation Models

Add code
Mar 16, 2026
Viaarxiv icon

Gradient Iterated Temporal-Difference Learning

Add code
Mar 08, 2026
Viaarxiv icon

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

Fine-Tuning without Performance Degradation

Add code
May 01, 2025
Figure 1 for Fine-Tuning without Performance Degradation
Figure 2 for Fine-Tuning without Performance Degradation
Figure 3 for Fine-Tuning without Performance Degradation
Figure 4 for Fine-Tuning without Performance Degradation
Viaarxiv icon

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

Add code
Nov 22, 2024
Viaarxiv icon

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 2 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 3 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 4 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Viaarxiv icon

q-exponential family for policy optimization

Add code
Aug 14, 2024
Viaarxiv icon