Picture for Yo-Sub Han

Yo-Sub Han

Linguistics-Aware Non-Distortionary LLM Watermarking

Add code
May 30, 2026
Viaarxiv icon

EPIC: Efficient and Parallel Inference under CFG Constraints for Diffusion Language Models

Add code
May 30, 2026
Viaarxiv icon

DLM-SWAI: Steering Diffusion Language Models Before They Unmask

Add code
May 28, 2026
Viaarxiv icon

STAB: Specification-driven Testing for Algorithmic Bottlenecks

Add code
May 27, 2026
Viaarxiv icon

Adaptive Steering and Remasking for Safe Generation in Diffusion Language Models

Add code
May 13, 2026
Viaarxiv icon

NCO: A Versatile Plug-in for Handling Negative Constraints in Decoding

Add code
May 11, 2026
Viaarxiv icon

Cross-Family Universality of Behavioral Axes via Anchor-Projected Representations

Add code
May 11, 2026
Viaarxiv icon

CRaFT: Circuit-Guided Refusal Feature Selection via Cross-Layer Transcoders

Add code
Apr 02, 2026
Viaarxiv icon

Steering Language Models Before They Speak: Logit-Level Interventions

Add code
Jan 16, 2026
Viaarxiv icon

How Does the Thinking Step Influence Model Safety? An Entropy-based Safety Reminder for LRMs

Add code
Jan 07, 2026
Viaarxiv icon