Picture for Zening Lin

Zening Lin

URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Add code
Nov 13, 2025
Figure 1 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 2 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 3 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 4 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Viaarxiv icon

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

Add code
Jan 07, 2024
Figure 1 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 2 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 3 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 4 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Viaarxiv icon

Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation

Add code
Oct 29, 2023
Figure 1 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 2 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 3 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 4 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Viaarxiv icon