Picture for Ming Hu

Ming Hu

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

Add code
May 14, 2026
Viaarxiv icon

Leveraging Multimodal LLMs for Built Environment and Housing Attribute Assessment from Street-View Imagery

Add code
Apr 22, 2026
Viaarxiv icon

MedProbeBench: Systematic Benchmarking at Deep Evidence Integration for Expert-level Medical Guideline

Add code
Apr 20, 2026
Viaarxiv icon

Radiology Report Generation for Low-Quality X-Ray Images

Add code
Apr 11, 2026
Viaarxiv icon

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Add code
Mar 29, 2026
Viaarxiv icon

FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement

Add code
Mar 20, 2026
Viaarxiv icon

Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery

Add code
Mar 17, 2026
Viaarxiv icon

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Add code
Mar 09, 2026
Viaarxiv icon

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

Add code
Feb 28, 2026
Viaarxiv icon