Picture for Lijie Hu

Lijie Hu

UGID: Unified Graph Isomorphism for Debiasing Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Functional Subspace Watermarking for Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering

Add code
Mar 18, 2026
Viaarxiv icon

SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Add code
Mar 16, 2026
Viaarxiv icon

Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency

Add code
Mar 12, 2026
Viaarxiv icon

Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness

Add code
Mar 11, 2026
Viaarxiv icon

Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

Add code
Mar 11, 2026
Viaarxiv icon

Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images

Add code
Mar 09, 2026
Viaarxiv icon

Understanding the Dynamics of Demonstration Conflict in In-Context Learning

Add code
Mar 03, 2026
Viaarxiv icon

Predicting LLM Output Length via Entropy-Guided Representations

Add code
Feb 12, 2026
Viaarxiv icon