Picture for Yan Zhang

Yan Zhang

Fellow, IEEE

FuXi-Air: Urban Air Quality Forecasting Based on Emission-Meteorology-Pollutant multimodal Machine Learning

Add code
Jun 09, 2025
Viaarxiv icon

UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning

Add code
Jun 08, 2025
Viaarxiv icon

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Add code
Jun 05, 2025
Viaarxiv icon

VidText: Towards Comprehensive Evaluation for Video Text Understanding

Add code
May 28, 2025
Viaarxiv icon

Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs

Add code
May 28, 2025
Viaarxiv icon

IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model

Add code
May 27, 2025
Viaarxiv icon

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Add code
May 22, 2025
Viaarxiv icon

Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses

Add code
May 19, 2025
Viaarxiv icon

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Add code
May 07, 2025
Viaarxiv icon

Temporal Attention Evolutional Graph Convolutional Network for Multivariate Time Series Forecasting

Add code
May 01, 2025
Viaarxiv icon