Alert button
Picture for Kyungjae Lee

Kyungjae Lee

Alert button

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

Add code
Bookmark button
Alert button
Mar 21, 2024
Kyungjae Lee, Dasol Hwang, Sunghyun Park, Youngsoo Jang, Moontae Lee

Figure 1 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 2 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 3 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 4 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Viaarxiv icon

Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion

Add code
Bookmark button
Alert button
Oct 27, 2023
Taehyun Cho, Seungyub Han, Heesoo Lee, Kyungjae Lee, Jungwoo Lee

Viaarxiv icon

PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering

Add code
Bookmark button
Alert button
Oct 24, 2023
Wookje Han, Jinsol Park, Kyungjae Lee

Viaarxiv icon

SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous Teleoperation Systems

Add code
Bookmark button
Alert button
Sep 25, 2023
Joonhyung Lee, Sangbeom Park, Jeongeun Park, Kyungjae Lee, Sungjoon Choi

Viaarxiv icon

On Monotonic Aggregation for Open-domain QA

Add code
Bookmark button
Alert button
Aug 08, 2023
Sang-eun Han, Yeonseok Jeong, Seung-won Hwang, Kyungjae Lee

Figure 1 for On Monotonic Aggregation for Open-domain QA
Figure 2 for On Monotonic Aggregation for Open-domain QA
Figure 3 for On Monotonic Aggregation for Open-domain QA
Figure 4 for On Monotonic Aggregation for Open-domain QA
Viaarxiv icon

When to Read Documents or QA History: On Unified and Selective Open-domain QA

Add code
Bookmark button
Alert button
Jun 07, 2023
Kyungjae Lee, Sang-eun Han, Seung-won Hwang, Moontae Lee

Figure 1 for When to Read Documents or QA History: On Unified and Selective Open-domain QA
Figure 2 for When to Read Documents or QA History: On Unified and Selective Open-domain QA
Figure 3 for When to Read Documents or QA History: On Unified and Selective Open-domain QA
Figure 4 for When to Read Documents or QA History: On Unified and Selective Open-domain QA
Viaarxiv icon

Revisiting Dense Retrieval with Unanswerable Counterfactuals

Add code
Bookmark button
Alert button
Apr 12, 2023
Yongho Song, Dahyun Lee, Kyungjae Lee, Jinyeong Yeo

Figure 1 for Revisiting Dense Retrieval with Unanswerable Counterfactuals
Figure 2 for Revisiting Dense Retrieval with Unanswerable Counterfactuals
Figure 3 for Revisiting Dense Retrieval with Unanswerable Counterfactuals
Figure 4 for Revisiting Dense Retrieval with Unanswerable Counterfactuals
Viaarxiv icon

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Add code
Bookmark button
Alert button
Feb 09, 2023
Joel Jang, Seungone Kim, Seonghyeon Ye, Doyoung Kim, Lajanugen Logeswaran, Moontae Lee, Kyungjae Lee, Minjoon Seo

Figure 1 for Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Figure 2 for Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Figure 3 for Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Figure 4 for Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Viaarxiv icon