Picture for Samuel Holt

Samuel Holt

Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search

Add code
Jun 10, 2025
Viaarxiv icon

G-Sim: Generative Simulations with Large Language Models and Gradient-Free Calibration

Add code
Jun 10, 2025
Viaarxiv icon

The AI Imperative: Scaling High-Quality Peer Review in Machine Learning

Add code
Jun 09, 2025
Viaarxiv icon

MuJoCo Playground

Add code
Feb 12, 2025
Viaarxiv icon

Automatically Learning Hybrid Digital Twins of Dynamical Systems

Add code
Oct 31, 2024
Figure 1 for Automatically Learning Hybrid Digital Twins of Dynamical Systems
Figure 2 for Automatically Learning Hybrid Digital Twins of Dynamical Systems
Figure 3 for Automatically Learning Hybrid Digital Twins of Dynamical Systems
Figure 4 for Automatically Learning Hybrid Digital Twins of Dynamical Systems
Viaarxiv icon

Discovering Preference Optimization Algorithms with and for Large Language Models

Add code
Jun 12, 2024
Figure 1 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 2 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 3 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 4 for Discovering Preference Optimization Algorithms with and for Large Language Models
Viaarxiv icon

ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference

Add code
Mar 16, 2024
Viaarxiv icon

Retrieval-Augmented Thought Process as Sequential Decision Making

Add code
Feb 12, 2024
Figure 1 for Retrieval-Augmented Thought Process as Sequential Decision Making
Figure 2 for Retrieval-Augmented Thought Process as Sequential Decision Making
Figure 3 for Retrieval-Augmented Thought Process as Sequential Decision Making
Figure 4 for Retrieval-Augmented Thought Process as Sequential Decision Making
Viaarxiv icon

Dense Reward for Free in Reinforcement Learning from Human Feedback

Add code
Feb 01, 2024
Figure 1 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 2 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 3 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 4 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Viaarxiv icon

Deep Generative Symbolic Regression

Add code
Dec 30, 2023
Viaarxiv icon