Picture for Jiaheng Zhang

Jiaheng Zhang

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Add code
Nov 11, 2025
Viaarxiv icon

SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking

Add code
Nov 05, 2025
Viaarxiv icon

TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models

Add code
Jun 15, 2025
Viaarxiv icon

Efficient Reasoning via Chain of Unconscious Thought

Add code
May 26, 2025
Viaarxiv icon

Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign Queries

Add code
May 21, 2025
Viaarxiv icon

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Add code
May 16, 2025
Viaarxiv icon

Geneshift: Impact of different scenario shift on Jailbreaking LLM

Add code
Apr 10, 2025
Viaarxiv icon

Efficient Inference for Large Reasoning Models: A Survey

Add code
Mar 29, 2025
Figure 1 for Efficient Inference for Large Reasoning Models: A Survey
Figure 2 for Efficient Inference for Large Reasoning Models: A Survey
Figure 3 for Efficient Inference for Large Reasoning Models: A Survey
Viaarxiv icon

Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models

Add code
Mar 12, 2025
Viaarxiv icon

Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning

Add code
Feb 17, 2025
Viaarxiv icon