Picture for Cheng Cui

Cheng Cui

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks

Add code
Mar 25, 2026
Viaarxiv icon

Real5-OmniDocBench: A Full-Scale Physical Reconstruction Benchmark for Robust Document Parsing in the Wild

Add code
Mar 04, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Add code
Jan 29, 2026
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction

Add code
Mar 21, 2025
Figure 1 for PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Figure 2 for PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Figure 3 for PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Figure 4 for PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Viaarxiv icon

WebCanvas: Benchmarking Web Agents in Online Environments

Add code
Jun 18, 2024
Viaarxiv icon

DETRs Beat YOLOs on Real-time Object Detection

Add code
Apr 17, 2023
Figure 1 for DETRs Beat YOLOs on Real-time Object Detection
Figure 2 for DETRs Beat YOLOs on Real-time Object Detection
Figure 3 for DETRs Beat YOLOs on Real-time Object Detection
Figure 4 for DETRs Beat YOLOs on Real-time Object Detection
Viaarxiv icon

GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks

Add code
Oct 05, 2022
Figure 1 for GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks
Figure 2 for GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks
Figure 3 for GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks
Figure 4 for GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks
Viaarxiv icon