Picture for Jing Zhang

Jing Zhang

The University of Sydney, Australia

Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

Add code
Dec 26, 2025
Viaarxiv icon

Degradation-Aware Metric Prompting for Hyperspectral Image Restoration

Add code
Dec 23, 2025
Viaarxiv icon

SARMAE: Masked Autoencoder for SAR Representation Learning

Add code
Dec 18, 2025
Viaarxiv icon

Reducing Pilots in Channel Estimation With Predictive Foundation Models

Add code
Dec 17, 2025
Viaarxiv icon

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging

Add code
Nov 15, 2025
Viaarxiv icon

Residual Diffusion Bridge Model for Image Restoration

Add code
Oct 27, 2025
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

Next-Generation AI-Native Wireless Communications: MCMC-Based Receiver Architectures for Unified Processing

Add code
Oct 02, 2025
Viaarxiv icon

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Add code
Oct 02, 2025
Figure 1 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 2 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 3 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 4 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Viaarxiv icon