Picture for Allen Nie

Allen Nie

Shammie

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Dec 13, 2023
Viaarxiv icon

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Add code
Oct 31, 2023
Figure 1 for MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Figure 2 for MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Figure 3 for MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Figure 4 for MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Viaarxiv icon

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

Add code
Jun 24, 2023
Figure 1 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 2 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 3 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Figure 4 for Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Viaarxiv icon

Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task

Add code
Apr 13, 2023
Figure 1 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 2 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 3 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Figure 4 for Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
Viaarxiv icon

Model-based Offline Reinforcement Learning with Local Misspecification

Add code
Jan 26, 2023
Viaarxiv icon

Giving Feedback on Interactive Student Programs with Meta-Exploration

Add code
Nov 16, 2022
Figure 1 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 2 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 3 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Figure 4 for Giving Feedback on Interactive Student Programs with Meta-Exploration
Viaarxiv icon

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Add code
Oct 16, 2022
Figure 1 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 2 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 3 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Figure 4 for Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Play to Grade: Testing Coding Games as Classifying Markov Decision Process

Add code
Oct 27, 2021
Figure 1 for Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Figure 2 for Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Figure 3 for Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Figure 4 for Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Aug 18, 2021
Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon