Picture for Meng Han

Meng Han

Silencing the Guardrails: Inference-Time Jailbreaking via Dynamic Contextual Representation Ablation

Add code
Apr 09, 2026
Viaarxiv icon

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

Add code
Apr 07, 2026
Viaarxiv icon

AttnDiff: Attention-based Differential Fingerprinting for Large Language Models

Add code
Apr 07, 2026
Viaarxiv icon

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

Add code
Apr 07, 2026
Viaarxiv icon

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

Add code
Apr 07, 2026
Viaarxiv icon

Policy of Thoughts: Scaling LLM Reasoning via Test-time Policy Evolution

Add code
Jan 28, 2026
Viaarxiv icon

MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs

Add code
Jan 26, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing

Add code
Jan 16, 2026
Viaarxiv icon

SME-YOLO: A Real-Time Detector for Tiny Defect Detection on PCB Surfaces

Add code
Jan 16, 2026
Viaarxiv icon