Picture for Ming Hu

Ming Hu

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs

Add code
Oct 02, 2025
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Viaarxiv icon

ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation

Add code
Aug 24, 2025
Viaarxiv icon

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Add code
Aug 09, 2025
Viaarxiv icon

TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification

Add code
May 23, 2025
Viaarxiv icon

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery

Add code
May 23, 2025
Viaarxiv icon

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding

Add code
May 22, 2025
Viaarxiv icon

RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions

Add code
May 19, 2025
Viaarxiv icon

MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment

Add code
May 14, 2025
Figure 1 for MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
Figure 2 for MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
Figure 3 for MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
Figure 4 for MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
Viaarxiv icon

Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model

Add code
May 13, 2025
Viaarxiv icon