Picture for Dongsheng Ma

Dongsheng Ma

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Add code
Apr 06, 2026
Viaarxiv icon

AgenticOCR: Parsing Only What You Need for Efficient Retrieval-Augmented Generation

Add code
Feb 27, 2026
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation

Add code
Apr 30, 2025
Figure 1 for CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation
Figure 2 for CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation
Figure 3 for CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation
Figure 4 for CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation
Viaarxiv icon

RARE: Retrieval-Augmented Reasoning Modeling

Add code
Mar 30, 2025
Viaarxiv icon