Picture for Wenqi Zhang

Wenqi Zhang

PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts

Add code
May 07, 2026
Viaarxiv icon

Towards Steering without Sacrifice: Principled Training of Steering Vectors for Prompt-only Interventions

Add code
May 07, 2026
Viaarxiv icon

Pause or Fabricate? Training Language Models for Grounded Reasoning

Add code
Apr 21, 2026
Viaarxiv icon

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

Add code
Apr 15, 2026
Viaarxiv icon

UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Add code
Apr 09, 2026
Viaarxiv icon

SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition

Add code
Mar 18, 2026
Viaarxiv icon

Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG

Add code
Feb 03, 2026
Viaarxiv icon

MirrorGuard: Toward Secure Computer-Use Agents via Simulation-to-Real Reasoning Correction

Add code
Jan 19, 2026
Viaarxiv icon

Clustering-Based User Selection in Federated Learning: Metadata Exploitation for 3GPP Networks

Add code
Jan 15, 2026
Viaarxiv icon