Picture for Seungyeon Jwa

Seungyeon Jwa

ParaPairAudioBench: Paralinguistic Pairwise Audio Benchmark for LALM-as-a-Judge

Add code
Jun 23, 2026
Viaarxiv icon

RTSGameBench: An RTS Benchmark for Strategic Reasoning by Vision-Language Models

Add code
Jun 18, 2026
Viaarxiv icon

OffsetBias: Leveraging Debiased Data for Tuning Evaluators

Add code
Jul 09, 2024
Figure 1 for OffsetBias: Leveraging Debiased Data for Tuning Evaluators
Figure 2 for OffsetBias: Leveraging Debiased Data for Tuning Evaluators
Figure 3 for OffsetBias: Leveraging Debiased Data for Tuning Evaluators
Figure 4 for OffsetBias: Leveraging Debiased Data for Tuning Evaluators
Viaarxiv icon