Picture for Willie Neiswanger

Willie Neiswanger

Department of Computer Science, Stanford University

Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution

Add code
Feb 13, 2024
Viaarxiv icon

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Add code
Feb 04, 2024
Figure 1 for DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models
Figure 2 for DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models
Figure 3 for DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models
Figure 4 for DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models
Viaarxiv icon

LLM360: Towards Fully Transparent Open-Source LLMs

Add code
Dec 11, 2023
Figure 1 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 2 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 3 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 4 for LLM360: Towards Fully Transparent Open-Source LLMs
Viaarxiv icon

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Dec 01, 2023
Viaarxiv icon

Making Scalable Meta Learning Practical

Add code
Oct 23, 2023
Figure 1 for Making Scalable Meta Learning Practical
Figure 2 for Making Scalable Meta Learning Practical
Figure 3 for Making Scalable Meta Learning Practical
Figure 4 for Making Scalable Meta Learning Practical
Viaarxiv icon

SlimPajama-DC: Understanding Data Combinations for LLM Training

Add code
Sep 19, 2023
Figure 1 for SlimPajama-DC: Understanding Data Combinations for LLM Training
Figure 2 for SlimPajama-DC: Understanding Data Combinations for LLM Training
Figure 3 for SlimPajama-DC: Understanding Data Combinations for LLM Training
Figure 4 for SlimPajama-DC: Understanding Data Combinations for LLM Training
Viaarxiv icon

Kernelized Offline Contextual Dueling Bandits

Add code
Jul 21, 2023
Viaarxiv icon

Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching

Add code
Mar 05, 2023
Figure 1 for Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Figure 2 for Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Figure 3 for Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Figure 4 for Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Dec 19, 2022
Figure 1 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 2 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 3 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 4 for Near-optimal Policy Identification in Active Reinforcement Learning
Viaarxiv icon

Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis

Add code
Oct 10, 2022
Figure 1 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 2 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 3 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 4 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Viaarxiv icon