Picture for Shurong Zheng

Shurong Zheng

GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models

Add code
Jan 08, 2026
Viaarxiv icon

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

Convergence of Continuous Normalizing Flows for Learning Probability Distributions

Add code
Mar 31, 2024
Viaarxiv icon