Picture for Ming Hu

Ming Hu

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack

Add code
Nov 17, 2025
Figure 1 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 2 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 3 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 4 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Viaarxiv icon

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs

Add code
Oct 02, 2025
Figure 1 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 2 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 3 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 4 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon

ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation

Add code
Aug 24, 2025
Viaarxiv icon

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Add code
Aug 09, 2025
Viaarxiv icon

TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification

Add code
May 23, 2025
Figure 1 for TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification
Figure 2 for TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification
Figure 3 for TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification
Figure 4 for TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification
Viaarxiv icon

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery

Add code
May 23, 2025
Figure 1 for Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
Figure 2 for Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
Figure 3 for Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
Figure 4 for Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
Viaarxiv icon

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding

Add code
May 22, 2025
Viaarxiv icon