Picture for Ya Zhang

Ya Zhang

ConText: Driving In-context Learning for Text Removal and Segmentation

Add code
Jun 04, 2025
Viaarxiv icon

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Add code
May 22, 2025
Viaarxiv icon

AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation

Add code
May 17, 2025
Viaarxiv icon

Multi-Agent System for Comprehensive Soccer Understanding

Add code
May 06, 2025
Viaarxiv icon

Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement

Add code
May 03, 2025
Viaarxiv icon

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Add code
Apr 29, 2025
Viaarxiv icon

ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification

Add code
Apr 29, 2025
Viaarxiv icon

Learning to Instruct for Visual Instruction Tuning

Add code
Mar 28, 2025
Viaarxiv icon

ChatBEV: A Visual Language Model that Understands BEV Maps

Add code
Mar 21, 2025
Viaarxiv icon

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Add code
Mar 06, 2025
Figure 1 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Figure 2 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Viaarxiv icon