Picture for Yubo Zhang

Yubo Zhang

PP-OCRv6: From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks

Add code
Jun 11, 2026
Viaarxiv icon

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Add code
Jun 02, 2026
Viaarxiv icon

PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks

Add code
Mar 25, 2026
Viaarxiv icon

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Add code
Jan 29, 2026
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

Neural Tangent Knowledge Distillation for Optical Convolutional Networks

Add code
Aug 11, 2025
Viaarxiv icon

RAG+: Enhancing Retrieval-Augmented Generation with Application-Aware Reasoning

Add code
Jun 13, 2025
Viaarxiv icon

Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning

Add code
Mar 31, 2025
Figure 1 for Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Figure 2 for Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Figure 3 for Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Figure 4 for Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Viaarxiv icon