Picture for Jiangning Zhang

Jiangning Zhang

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding

Add code
Jun 16, 2025
Viaarxiv icon

PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Add code
Jun 09, 2025
Viaarxiv icon

Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning

Add code
May 26, 2025
Viaarxiv icon

So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Add code
May 24, 2025
Viaarxiv icon

Swin DiT: Diffusion Transformer using Pseudo Shifted Windows

Add code
May 19, 2025
Viaarxiv icon

Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Add code
Apr 19, 2025
Viaarxiv icon

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer

Add code
Mar 21, 2025
Viaarxiv icon

Image Inversion: A Survey from GANs to Diffusion and Beyond

Add code
Feb 17, 2025
Viaarxiv icon