Picture for Yonghui Wang

Yonghui Wang

TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding

Add code
Apr 15, 2024
Figure 1 for TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
Figure 2 for TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
Figure 3 for TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
Figure 4 for TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
Viaarxiv icon

Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs

Add code
Nov 22, 2023
Viaarxiv icon

Progressive Recurrent Network for Shadow Removal

Add code
Nov 01, 2023
Figure 1 for Progressive Recurrent Network for Shadow Removal
Figure 2 for Progressive Recurrent Network for Shadow Removal
Figure 3 for Progressive Recurrent Network for Shadow Removal
Figure 4 for Progressive Recurrent Network for Shadow Removal
Viaarxiv icon

Detect Any Shadow: Segment Anything for Video Shadow Detection

Add code
May 26, 2023
Figure 1 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 2 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 3 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 4 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Viaarxiv icon

UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior

Add code
Oct 15, 2022
Figure 1 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 2 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 3 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 4 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Viaarxiv icon