Picture for JinYeong Bak

JinYeong Bak

Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

Add code
Jun 06, 2025
Viaarxiv icon

Research on Superalignment Should Advance Now with Parallel Optimization of Competence and Conformity

Add code
Mar 08, 2025
Viaarxiv icon

Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions

Add code
Jan 30, 2025
Figure 1 for Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions
Figure 2 for Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions
Figure 3 for Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions
Figure 4 for Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions
Viaarxiv icon

The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment

Add code
Dec 24, 2024
Figure 1 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 2 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 3 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 4 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Viaarxiv icon

Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation

Add code
Nov 22, 2024
Figure 1 for Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation
Figure 2 for Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation
Figure 3 for Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation
Figure 4 for Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation
Viaarxiv icon

Perturb-and-Compare Approach for Detecting Out-of-Distribution Samples in Constrained Access Environments

Add code
Aug 19, 2024
Viaarxiv icon

KpopMT: Translation Dataset with Terminology for Kpop Fandom

Add code
Jul 10, 2024
Figure 1 for KpopMT: Translation Dataset with Terminology for Kpop Fandom
Figure 2 for KpopMT: Translation Dataset with Terminology for Kpop Fandom
Figure 3 for KpopMT: Translation Dataset with Terminology for Kpop Fandom
Figure 4 for KpopMT: Translation Dataset with Terminology for Kpop Fandom
Viaarxiv icon

MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control

Add code
Jul 03, 2024
Viaarxiv icon

PEMA: Plug-in External Memory Adaptation for Language Models

Add code
Nov 14, 2023
Figure 1 for PEMA: Plug-in External Memory Adaptation for Language Models
Figure 2 for PEMA: Plug-in External Memory Adaptation for Language Models
Figure 3 for PEMA: Plug-in External Memory Adaptation for Language Models
Figure 4 for PEMA: Plug-in External Memory Adaptation for Language Models
Viaarxiv icon

From Values to Opinions: Predicting Human Behaviors and Stances Using Value-Injected Large Language Models

Add code
Oct 27, 2023
Viaarxiv icon