Picture for Moontae Lee

Moontae Lee

LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs

Add code
May 18, 2024
Viaarxiv icon

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Add code
May 07, 2024
Viaarxiv icon

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Add code
May 02, 2024
Viaarxiv icon

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Add code
Apr 26, 2024
Viaarxiv icon

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

Add code
Mar 21, 2024
Figure 1 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 2 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 3 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 4 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Viaarxiv icon

YTCommentQA: Video Question Answerability in Instructional Videos

Add code
Jan 30, 2024
Viaarxiv icon

Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models

Add code
Dec 05, 2023
Viaarxiv icon

Code Models are Zero-shot Precondition Reasoners

Add code
Nov 16, 2023
Viaarxiv icon

From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning

Add code
Oct 24, 2023
Viaarxiv icon

Merging Generated and Retrieved Knowledge for Open-Domain QA

Add code
Oct 22, 2023
Figure 1 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 2 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 3 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 4 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Viaarxiv icon