Picture for Zongbo Han

Zongbo Han

Towards World Models in Biomedical Research

Add code
Jun 04, 2026
Viaarxiv icon

MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains

Add code
Nov 14, 2025
Viaarxiv icon

MedSG-Bench: A Benchmark for Medical Image Sequences Grounding

Add code
May 17, 2025
Viaarxiv icon

Turing Machine Evaluation for Large Language Model

Add code
Apr 29, 2025
Figure 1 for Turing Machine Evaluation for Large Language Model
Figure 2 for Turing Machine Evaluation for Large Language Model
Figure 3 for Turing Machine Evaluation for Large Language Model
Figure 4 for Turing Machine Evaluation for Large Language Model
Viaarxiv icon

DOTA: Distributional Test-Time Adaptation of Vision-Language Models

Add code
Sep 28, 2024
Figure 1 for DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Figure 2 for DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Figure 3 for DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Figure 4 for DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Viaarxiv icon

Confidence-aware multi-modality learning for eye disease screening

Add code
May 28, 2024
Figure 1 for Confidence-aware multi-modality learning for eye disease screening
Figure 2 for Confidence-aware multi-modality learning for eye disease screening
Figure 3 for Confidence-aware multi-modality learning for eye disease screening
Figure 4 for Confidence-aware multi-modality learning for eye disease screening
Viaarxiv icon

Hallucination of Multimodal Large Language Models: A Survey

Add code
Apr 29, 2024
Figure 1 for Hallucination of Multimodal Large Language Models: A Survey
Figure 2 for Hallucination of Multimodal Large Language Models: A Survey
Figure 3 for Hallucination of Multimodal Large Language Models: A Survey
Figure 4 for Hallucination of Multimodal Large Language Models: A Survey
Viaarxiv icon

Multimodal Fusion on Low-quality Data: A Comprehensive Survey

Add code
Apr 27, 2024
Figure 1 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 2 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 3 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 4 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Viaarxiv icon

Selective Learning: Towards Robust Calibration with Dynamic Regularization

Add code
Feb 13, 2024
Figure 1 for Selective Learning: Towards Robust Calibration with Dynamic Regularization
Figure 2 for Selective Learning: Towards Robust Calibration with Dynamic Regularization
Figure 3 for Selective Learning: Towards Robust Calibration with Dynamic Regularization
Figure 4 for Selective Learning: Towards Robust Calibration with Dynamic Regularization
Viaarxiv icon

Skip : A Simple Method to Reduce Hallucination in Large Vision-Language Models

Add code
Feb 12, 2024
Viaarxiv icon