Picture for Allen Nie

Allen Nie

Shammie

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Add code
Jun 23, 2024
Viaarxiv icon

OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators

Add code
May 27, 2024
Viaarxiv icon

The Importance of Directional Feedback for LLM-based Optimizers

Add code
May 26, 2024
Figure 1 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 2 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 3 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 4 for The Importance of Directional Feedback for LLM-based Optimizers
Viaarxiv icon

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Dec 13, 2023
Figure 1 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 2 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 3 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 4 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Viaarxiv icon

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Add code
Oct 31, 2023
Viaarxiv icon

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

Add code
Jun 24, 2023
Figure 1 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 2 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 3 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 4 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Viaarxiv icon

Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task

Add code
Apr 13, 2023
Figure 1 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 2 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 3 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 4 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Viaarxiv icon

Model-based Offline Reinforcement Learning with Local Misspecification

Add code
Jan 26, 2023
Figure 1 for Model-based Offline Reinforcement Learning with Local Misspecification
Figure 2 for Model-based Offline Reinforcement Learning with Local Misspecification
Viaarxiv icon

Giving Feedback on Interactive Student Programs with Meta-Exploration

Add code
Nov 16, 2022
Figure 1 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 2 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 3 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 4 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Viaarxiv icon

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Add code
Oct 16, 2022
Figure 1 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 2 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 3 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 4 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Viaarxiv icon