Picture for Alice Oh

Alice Oh

KAIST

FINEST: Improving LLM Responses to Sensitive Topics Through Fine-Grained Evaluation

Add code
Mar 04, 2026
Viaarxiv icon

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

Add code
Mar 03, 2026
Viaarxiv icon

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

Add code
Feb 13, 2026
Viaarxiv icon

Cultural Compass: A Framework for Organizing Societal Norms to Detect Violations in Human-AI Conversations

Add code
Jan 12, 2026
Viaarxiv icon

Solar Open Technical Report

Add code
Jan 11, 2026
Viaarxiv icon

OLA: Output Language Alignment in Code-Switched LLM Interactions

Add code
Jan 07, 2026
Viaarxiv icon

One-Topic-Doesn't-Fit-All: Transcreating Reading Comprehension Test for Personalized Learning

Add code
Nov 12, 2025
Figure 1 for One-Topic-Doesn't-Fit-All: Transcreating Reading Comprehension Test for Personalized Learning
Figure 2 for One-Topic-Doesn't-Fit-All: Transcreating Reading Comprehension Test for Personalized Learning
Figure 3 for One-Topic-Doesn't-Fit-All: Transcreating Reading Comprehension Test for Personalized Learning
Viaarxiv icon

Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues

Add code
Oct 21, 2025
Viaarxiv icon

KORMo: Korean Open Reasoning Model for Everyone

Add code
Oct 10, 2025
Figure 1 for KORMo: Korean Open Reasoning Model for Everyone
Figure 2 for KORMo: Korean Open Reasoning Model for Everyone
Figure 3 for KORMo: Korean Open Reasoning Model for Everyone
Figure 4 for KORMo: Korean Open Reasoning Model for Everyone
Viaarxiv icon

Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models

Add code
Aug 12, 2025
Viaarxiv icon