Picture for Seongyun Lee

Seongyun Lee

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Add code
Jun 09, 2024
Figure 1 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 2 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 3 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 4 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Viaarxiv icon

Aligning to Thousands of Preferences via System Message Generalization

Add code
May 28, 2024
Viaarxiv icon

LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs

Add code
May 18, 2024
Viaarxiv icon

Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation

Add code
Jan 12, 2024
Viaarxiv icon

Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision

Add code
Nov 14, 2023
Viaarxiv icon

Zero-Shot Dense Video Captioning by Jointly Optimizing Text and Moment

Add code
Jul 11, 2023
Viaarxiv icon

LIQUID: A Framework for List Question Answering Dataset Generation

Add code
Feb 06, 2023
Viaarxiv icon