Picture for Euntae Kim

Euntae Kim

HarDBench: A Benchmark for Draft-Based Co-Authoring Jailbreak Attacks for Safe Human-LLM Collaborative Writing

Add code
Apr 21, 2026
Viaarxiv icon

NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models

Add code
Nov 09, 2025
Figure 1 for NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models
Figure 2 for NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models
Figure 3 for NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models
Figure 4 for NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models
Viaarxiv icon