Picture for Yoonjoo Lee

Yoonjoo Lee

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Add code
Jun 09, 2024
Figure 1 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 2 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 3 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 4 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Viaarxiv icon

One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations

Add code
May 09, 2024
Viaarxiv icon

PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers

Add code
Mar 05, 2024
Viaarxiv icon

EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria

Add code
Sep 24, 2023
Viaarxiv icon

LMCanvas: Object-Oriented Interaction to Personalize Large Language Model-Powered Writing Environments

Add code
Mar 27, 2023
Viaarxiv icon