Picture for Yan Zhang

Yan Zhang

Fellow, IEEE

How Far Are Video Models from True Multimodal Reasoning?

Add code
Apr 21, 2026
Viaarxiv icon

Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning

Add code
Apr 09, 2026
Viaarxiv icon

Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos

Add code
Mar 18, 2026
Viaarxiv icon

GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics

Add code
Mar 12, 2026
Viaarxiv icon

Bilevel Layer-Positioning LoRA for Real Image Dehazing

Add code
Mar 11, 2026
Viaarxiv icon

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance

Add code
Mar 09, 2026
Viaarxiv icon

DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis

Add code
Mar 02, 2026
Viaarxiv icon

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

Add code
Feb 28, 2026
Viaarxiv icon

Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding

Add code
Feb 28, 2026
Viaarxiv icon

UniVBench: Towards Unified Evaluation for Video Foundation Models

Add code
Feb 25, 2026
Viaarxiv icon