Picture for Silviu Pitis

Silviu Pitis

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Add code
Sep 01, 2024
Figure 1 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 2 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 3 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 4 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Viaarxiv icon

Improving Context-Aware Preference Modeling for Language Models

Add code
Jul 20, 2024
Viaarxiv icon

Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards

Add code
Sep 30, 2023
Viaarxiv icon

Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Add code
Sep 25, 2023
Viaarxiv icon

Boosted Prompt Ensembles for Large Language Models

Add code
Apr 12, 2023
Viaarxiv icon

Large Language Models Are Human-Level Prompt Engineers

Add code
Nov 03, 2022
Viaarxiv icon

MoCoDA: Model-based Counterfactual Data Augmentation

Add code
Oct 20, 2022
Figure 1 for MoCoDA: Model-based Counterfactual Data Augmentation
Figure 2 for MoCoDA: Model-based Counterfactual Data Augmentation
Figure 3 for MoCoDA: Model-based Counterfactual Data Augmentation
Figure 4 for MoCoDA: Model-based Counterfactual Data Augmentation
Viaarxiv icon

Counterfactual Data Augmentation using Locally Factored Dynamics

Add code
Jul 06, 2020
Figure 1 for Counterfactual Data Augmentation using Locally Factored Dynamics
Figure 2 for Counterfactual Data Augmentation using Locally Factored Dynamics
Figure 3 for Counterfactual Data Augmentation using Locally Factored Dynamics
Figure 4 for Counterfactual Data Augmentation using Locally Factored Dynamics
Viaarxiv icon

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

Add code
Jul 06, 2020
Figure 1 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 2 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 3 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Figure 4 for Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Viaarxiv icon

An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality

Add code
Feb 14, 2020
Figure 1 for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
Figure 2 for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
Figure 3 for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
Figure 4 for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
Viaarxiv icon