Picture for Sang Seo

Sang Seo

An Evaluation of Data Leakage Risks in Tool-Using LLM Agents in Realistic Scenarios

Add code
Jun 15, 2026
Viaarxiv icon

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Add code
May 07, 2026
Viaarxiv icon

Improving Methodologies for LLM Evaluations Across Global Languages

Add code
Jan 22, 2026
Viaarxiv icon

Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats

Add code
Jan 22, 2026
Viaarxiv icon