Picture for Zijie Xin

Zijie Xin

Fundus-R1: Training a Fundus-Reading MLLM with Knowledge-Aware Reasoning on Public Data

Add code
Apr 09, 2026
Viaarxiv icon

SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval

Add code
Mar 11, 2026
Viaarxiv icon

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning

Add code
Mar 25, 2025
Viaarxiv icon