Picture for Wenpeng Xing

Wenpeng Xing

Silencing the Guardrails: Inference-Time Jailbreaking via Dynamic Contextual Representation Ablation

Add code
Apr 09, 2026
Viaarxiv icon

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

Add code
Apr 07, 2026
Viaarxiv icon

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

Add code
Apr 07, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models

Add code
Jan 13, 2026
Viaarxiv icon

DIAP: A Decentralized Agent Identity Protocol with Zero-Knowledge Proofs and a Hybrid P2P Stack

Add code
Nov 06, 2025
Viaarxiv icon

HGMF: A Hierarchical Gaussian Mixture Framework for Scalable Tool Invocation within the Model Context Protocol

Add code
Aug 11, 2025
Viaarxiv icon

UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting

Add code
Aug 08, 2025
Viaarxiv icon

MEraser: An Effective Fingerprint Erasure Approach for Large Language Models

Add code
Jun 14, 2025
Viaarxiv icon

NeuRel-Attack: Neuron Relearning for Safety Disalignment in Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon