Picture for Jiangning Zhang

Jiangning Zhang

College of Control Science and Engineering, Zhejiang University, Hangzhou, China

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

Add code
Dec 16, 2025
Viaarxiv icon

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$

Add code
Dec 15, 2025
Viaarxiv icon

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems

Add code
Dec 11, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

EfficientIML: Efficient High-Resolution Image Manipulation Localization

Add code
Sep 10, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding

Add code
Jun 16, 2025
Viaarxiv icon

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Add code
Jun 09, 2025
Viaarxiv icon