Picture for Linke Ouyang

Linke Ouyang

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Add code
Apr 06, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Add code
Dec 11, 2025
Figure 1 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 2 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 3 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 4 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Add code
Dec 10, 2024
Figure 1 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 2 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 3 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 4 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Viaarxiv icon

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Add code
Dec 03, 2024
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation

Add code
Sep 05, 2024
Figure 1 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 2 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 3 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 4 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Viaarxiv icon

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Add code
Jul 03, 2024
Figure 1 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 2 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 3 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 4 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Viaarxiv icon

DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data

Add code
May 28, 2024
Viaarxiv icon