Picture for Jiangning Zhang

Jiangning Zhang

College of Control Science and Engineering, Zhejiang University, Hangzhou, China

M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding

Add code
Jan 13, 2026
Viaarxiv icon

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

Add code
Jan 07, 2026
Viaarxiv icon

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

Add code
Jan 06, 2026
Viaarxiv icon

UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation

Add code
Dec 25, 2025
Viaarxiv icon

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection

Add code
Dec 23, 2025
Viaarxiv icon

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

Add code
Dec 16, 2025
Viaarxiv icon

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$

Add code
Dec 15, 2025
Viaarxiv icon

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems

Add code
Dec 11, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon