Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sumin Yu

PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset

Mar 24, 2026

Sumin Yu, Juhyeon Park, Taesup Moon

Abstract:We present PopResume, a population-representative resume dataset for causal fairness auditing of LLM- and VLM-based resume screening systems. Unlike existing benchmarks that rely on manually injected demographic information and outcome-level disparities, PopResume is grounded in population statistics and preserves natural attribute relationships, enabling path-specific effect (PSE)-based fairness evaluation. We decompose the effect of a protected attribute on resume scores into two paths: the business necessity path, mediated by job-relevant qualifications, and the redlining path, mediated by demographic proxies. This distinction allows auditors to separate legally permissible from impermissible sources of disparity. Evaluating four LLMs and four VLMs on PopResume's 60.8K resumes across five occupations, we identify five representative discrimination patterns that aggregate metrics fail to capture. Our results demonstrate that PSE-based evaluation reveals fairness issues masked by outcome-level measures, underscoring the need for causally-grounded auditing frameworks in AI-assisted hiring.

* Under Review

Via

Access Paper or Ask Questions

SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generation

Nov 14, 2025

Sumin Yu, Taesup Moon

Figure 1 for SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generation

Figure 2 for SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generation

Figure 3 for SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generation

Figure 4 for SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generation

Abstract:While diffusion-based T2I models have achieved remarkable image generation quality, they also enable easy creation of harmful content, raising social concerns and highlighting the need for safer generation. Existing inference-time guiding methods lack both adaptivity--adjusting guidance strength based on the prompt--and selectivity--targeting only unsafe regions of the image. Our method, SP-Guard, addresses these limitations by estimating prompt harmfulness and applying a selective guidance mask to guide only unsafe areas. Experiments show that SP-Guard generates safer images than existing methods while minimizing unintended content alteration. Beyond improving safety, our findings highlight the importance of transparency and controllability in image generation.

* Accepted for presentation at TRUST-AI Workshop, ECAI 2025. Proceedings to appear in CEUR-WS

Via

Access Paper or Ask Questions