Picture for Yongdong Zhang

Yongdong Zhang

See More, Match Better: Multi-Source Feature Fusion for Two-View Correspondence Learning

Add code
Jun 08, 2026
Viaarxiv icon

Towards Accurate Emotion-Attributed Video Captioning via Fine-grained Emotion-Cause Pair Extraction

Add code
Jun 07, 2026
Viaarxiv icon

CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict

Add code
May 27, 2026
Viaarxiv icon

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Add code
May 20, 2026
Viaarxiv icon

Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models

Add code
May 06, 2026
Viaarxiv icon

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Add code
May 05, 2026
Viaarxiv icon

CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers

Add code
Apr 21, 2026
Viaarxiv icon

EMS: Multi-Agent Voting via Efficient Majority-then-Stopping

Add code
Apr 03, 2026
Viaarxiv icon

Scale over Preference: The Impact of AI-Generated Content on Online Content Ecology

Add code
Apr 02, 2026
Viaarxiv icon

FACE-net: Factual Calibration and Emotion Augmentation for Retrieval-enhanced Emotional Video Captioning

Add code
Mar 18, 2026
Viaarxiv icon