Picture for Dezhang Kong

Dezhang Kong

Policy of Thoughts: Scaling LLM Reasoning via Test-time Policy Evolution

Add code
Jan 28, 2026
Viaarxiv icon

MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs

Add code
Jan 26, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection

Add code
Jan 13, 2026
Viaarxiv icon

Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection

Add code
Dec 27, 2025
Viaarxiv icon

NeuRel-Attack: Neuron Relearning for Safety Disalignment in Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon