Picture for Bowen Liu

Bowen Liu

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

Add code
May 07, 2026
Viaarxiv icon

Divide-then-Diagnose: Weaving Clinician-Inspired Contexts for Ultra-Long Capsule Endoscopy Videos

Add code
Apr 23, 2026
Viaarxiv icon

Beyond Language: Grounding Referring Expressions with Hand Pointing in Egocentric Vision

Add code
Mar 27, 2026
Viaarxiv icon

Ran Score: a LLM-based Evaluation Score for Radiology Report Generation

Add code
Mar 24, 2026
Viaarxiv icon

How to Utilize Complementary Vision-Text Information for 2D Structure Understanding

Add code
Mar 17, 2026
Viaarxiv icon

EyeWorld: A Generative World Model of Ocular State and Dynamics

Add code
Mar 14, 2026
Viaarxiv icon

Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling

Add code
Mar 09, 2026
Viaarxiv icon

QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image

Add code
Mar 02, 2026
Viaarxiv icon

AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms?

Add code
Jan 08, 2026
Viaarxiv icon

Understanding What Is Not Said:Referring Remote Sensing Image Segmentation with Scarce Expressions

Add code
Oct 26, 2025
Viaarxiv icon