Picture for Jie Zhang

Jie Zhang

Chongqing Jinshan Science & Technology

State-Dependent Safety Failures in Multi-Turn Language Model Interaction

Add code
Mar 15, 2026
Viaarxiv icon

What Makes VLMs Robust? Towards Reconciling Robustness and Accuracy in Vision-Language Models

Add code
Mar 13, 2026
Viaarxiv icon

TRACE: Structure-Aware Character Encoding for Robust and Generalizable Document Watermarking

Add code
Mar 13, 2026
Viaarxiv icon

Generalized Recognition of Basic Surgical Actions Enables Skill Assessment and Vision-Language-Model-based Surgical Planning

Add code
Mar 13, 2026
Viaarxiv icon

Neural Gate: Mitigating Privacy Risks in LVLMs via Neuron-Level Gradient Gating

Add code
Mar 13, 2026
Viaarxiv icon

INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs

Add code
Mar 12, 2026
Viaarxiv icon

Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction

Add code
Mar 11, 2026
Viaarxiv icon

SURE: Semi-dense Uncertainty-REfined Feature Matching

Add code
Mar 05, 2026
Viaarxiv icon

Towards Generalized Multimodal Homography Estimation

Add code
Mar 04, 2026
Viaarxiv icon

Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark

Add code
Feb 28, 2026
Viaarxiv icon