Picture for Zikang Ding

Zikang Ding

Functional Subspace Watermarking for Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

UGID: Unified Graph Isomorphism for Debiasing Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

Add code
Mar 18, 2026
Viaarxiv icon

Delayed Backdoor Attacks: Exploring the Temporal Dimension as a New Attack Surface in Pre-Trained Models

Add code
Mar 12, 2026
Viaarxiv icon