Alert button

DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

Nov 20, 2023
Hao Feng, Qi Liu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang

Figure 1 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 2 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 3 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 4 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: