Picture for Xiaomin Wu

Xiaomin Wu

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Add code
Mar 10, 2026
Viaarxiv icon

PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology

Add code
Aug 13, 2024
Figure 1 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 2 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 3 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 4 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Viaarxiv icon

What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning

Add code
Oct 31, 2023
Figure 1 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 2 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 3 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 4 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Viaarxiv icon

HTEC: Human Transcription Error Correction

Add code
Sep 18, 2023
Figure 1 for HTEC: Human Transcription Error Correction
Figure 2 for HTEC: Human Transcription Error Correction
Figure 3 for HTEC: Human Transcription Error Correction
Figure 4 for HTEC: Human Transcription Error Correction
Viaarxiv icon