Picture for Xianwei Mao

Xianwei Mao

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

Add code
May 24, 2025
Viaarxiv icon