Picture for Lex Weaver

Lex Weaver

Reinforcement Learning From State and Temporal Differences

Add code
Dec 23, 2025
Viaarxiv icon

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

Add code
Jan 10, 2013
Figure 1 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 2 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 3 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 4 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Viaarxiv icon

KnightCap: A chess program that learns by combining TD with game-tree search

Add code
Jan 10, 1999
Figure 1 for KnightCap: A chess program that learns by combining TD with game-tree search
Figure 2 for KnightCap: A chess program that learns by combining TD with game-tree search
Figure 3 for KnightCap: A chess program that learns by combining TD with game-tree search
Figure 4 for KnightCap: A chess program that learns by combining TD with game-tree search
Viaarxiv icon

TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search

Add code
Jan 05, 1999
Figure 1 for TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search
Viaarxiv icon

Evolution of Neural Networks to Play the Game of Dots-and-Boxes

Add code
Sep 28, 1998
Figure 1 for Evolution of Neural Networks to Play the Game of Dots-and-Boxes
Figure 2 for Evolution of Neural Networks to Play the Game of Dots-and-Boxes
Figure 3 for Evolution of Neural Networks to Play the Game of Dots-and-Boxes
Figure 4 for Evolution of Neural Networks to Play the Game of Dots-and-Boxes
Viaarxiv icon