Picture for Allen Nie

Allen Nie

Shammie

POLCA: Stochastic Generative Optimization with LLM

Add code
Mar 16, 2026
Viaarxiv icon

ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

Add code
Feb 23, 2026
Viaarxiv icon

Learning Game-Playing Agents with Generative Code Optimization

Add code
Aug 27, 2025
Viaarxiv icon

Provably Learning from Language Feedback

Add code
Jun 12, 2025
Viaarxiv icon

Teaching Large Language Models to Reason through Learning and Forgetting

Add code
Apr 15, 2025
Viaarxiv icon

Predicting Long Term Sequential Policy Value Using Softer Surrogates

Add code
Dec 30, 2024
Viaarxiv icon

Improving Parallel Program Performance Through DSL-Driven Code Generation with LLM Optimizers

Add code
Oct 21, 2024
Viaarxiv icon

EVOLvE: Evaluating and Optimizing LLMs For Exploration

Add code
Oct 08, 2024
Figure 1 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 2 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 3 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 4 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Viaarxiv icon

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Add code
Jun 23, 2024
Figure 1 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 2 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 3 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 4 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Viaarxiv icon

OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators

Add code
May 27, 2024
Figure 1 for OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Figure 2 for OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Figure 3 for OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Figure 4 for OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Viaarxiv icon