Picture for Xingdi Yuan

Xingdi Yuan

Policy Improvement using Language Feedback Models

Add code
Feb 25, 2024
Figure 1 for Policy Improvement using Language Feedback Models
Figure 2 for Policy Improvement using Language Feedback Models
Figure 3 for Policy Improvement using Language Feedback Models
Figure 4 for Policy Improvement using Language Feedback Models
Viaarxiv icon

V-STaR: Training Verifiers for Self-Taught Reasoners

Add code
Feb 09, 2024
Figure 1 for V-STaR: Training Verifiers for Self-Taught Reasoners
Figure 2 for V-STaR: Training Verifiers for Self-Taught Reasoners
Figure 3 for V-STaR: Training Verifiers for Self-Taught Reasoners
Figure 4 for V-STaR: Training Verifiers for Self-Taught Reasoners
Viaarxiv icon

Guiding Language Model Reasoning with Planning Tokens

Add code
Oct 09, 2023
Figure 1 for Guiding Language Model Reasoning with Planning Tokens
Figure 2 for Guiding Language Model Reasoning with Planning Tokens
Figure 3 for Guiding Language Model Reasoning with Planning Tokens
Figure 4 for Guiding Language Model Reasoning with Planning Tokens
Viaarxiv icon

Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference

Add code
Jun 21, 2023
Figure 1 for Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
Figure 2 for Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
Figure 3 for Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
Figure 4 for Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
Viaarxiv icon

Augmenting Autotelic Agents with Large Language Models

Add code
May 21, 2023
Viaarxiv icon

It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance

Add code
May 15, 2023
Figure 1 for It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Figure 2 for It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Figure 3 for It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Figure 4 for It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Viaarxiv icon

Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

Add code
Apr 17, 2023
Viaarxiv icon

A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld

Add code
Feb 24, 2023
Figure 1 for A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Figure 2 for A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Figure 3 for A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Figure 4 for A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Viaarxiv icon

GPT-3-driven pedagogical agents for training children's curious question-asking skills

Add code
Dec 08, 2022
Viaarxiv icon

Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation

Add code
Sep 22, 2022
Figure 1 for Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Figure 2 for Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Figure 3 for Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Figure 4 for Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Viaarxiv icon