Picture for Juhyun Oh

Juhyun Oh

OLA: Output Language Alignment in Code-Switched LLM Interactions

Add code
Jan 07, 2026
Viaarxiv icon

Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues

Add code
Oct 21, 2025
Viaarxiv icon

Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation

Add code
Jun 24, 2025
Viaarxiv icon

Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents

Add code
Jun 05, 2025
Viaarxiv icon

Uncovering Factor Level Preferences to Improve Human-Model Alignment

Add code
Oct 09, 2024
Figure 1 for Uncovering Factor Level Preferences to Improve Human-Model Alignment
Figure 2 for Uncovering Factor Level Preferences to Improve Human-Model Alignment
Figure 3 for Uncovering Factor Level Preferences to Improve Human-Model Alignment
Figure 4 for Uncovering Factor Level Preferences to Improve Human-Model Alignment
Viaarxiv icon

Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore

Add code
Mar 01, 2024
Viaarxiv icon

The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate

Add code
Feb 09, 2024
Figure 1 for The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate
Figure 2 for The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate
Figure 3 for The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate
Figure 4 for The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate
Viaarxiv icon

KOLD: Korean Offensive Language Dataset

Add code
May 23, 2022
Figure 1 for KOLD: Korean Offensive Language Dataset
Figure 2 for KOLD: Korean Offensive Language Dataset
Figure 3 for KOLD: Korean Offensive Language Dataset
Figure 4 for KOLD: Korean Offensive Language Dataset
Viaarxiv icon

KLUE: Korean Language Understanding Evaluation

Add code
Jun 11, 2021
Figure 1 for KLUE: Korean Language Understanding Evaluation
Figure 2 for KLUE: Korean Language Understanding Evaluation
Figure 3 for KLUE: Korean Language Understanding Evaluation
Figure 4 for KLUE: Korean Language Understanding Evaluation
Viaarxiv icon