Picture for Ziyang Chen

Ziyang Chen

VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

Add code
Aug 09, 2025
Viaarxiv icon

RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents

Add code
Jul 30, 2025
Viaarxiv icon

Libra: Large Chinese-based Safeguard for AI Content

Add code
Jul 29, 2025
Viaarxiv icon

Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation

Add code
May 28, 2025
Viaarxiv icon

VISTA: Enhancing Vision-Text Alignment in MLLMs via Cross-Modal Mutual Information Maximization

Add code
May 19, 2025
Viaarxiv icon

Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding

Add code
Mar 17, 2025
Viaarxiv icon

A Survey of fMRI to Image Reconstruction

Add code
Feb 24, 2025
Viaarxiv icon

From Visuals to Vocabulary: Establishing Equivalence Between Image and Text Token Through Autoregressive Pre-training in MLLMs

Add code
Feb 13, 2025
Viaarxiv icon

Test Time Training for 4D Medical Image Interpolation

Add code
Feb 04, 2025
Figure 1 for Test Time Training for 4D Medical Image Interpolation
Figure 2 for Test Time Training for 4D Medical Image Interpolation
Figure 3 for Test Time Training for 4D Medical Image Interpolation
Figure 4 for Test Time Training for 4D Medical Image Interpolation
Viaarxiv icon

GPS as a Control Signal for Image Generation

Add code
Jan 21, 2025
Viaarxiv icon