Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Aug 01, 2025

Yufeng Zhong, Zhixiong Zeng, Lei Chen, Longrong Yang, Liming Zheng, Jing Huang, Siqi Yang, Lin Ma

Figure 1 for DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Figure 2 for DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Figure 3 for DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Figure 4 for DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Share this with someone who'll enjoy it:

Abstract:Optical Character Recognition (OCR) for mathematical formula is essential for the intelligent analysis of scientific literature. However, both task-specific and general vision-language models often struggle to handle the structural diversity, complexity, and real-world variability inherent in mathematical content. In this work, we present DocTron-Formula, a unified framework built upon general vision-language models, thereby eliminating the need for specialized architectures. Furthermore, we introduce CSFormula, a large-scale and challenging dataset that encompasses multidisciplinary and structurally complex formulas at the line, paragraph, and page levels. Through straightforward supervised fine-tuning, our approach achieves state-of-the-art performance across a variety of styles, scientific domains, and complex layouts. Experimental results demonstrate that our method not only surpasses specialized models in terms of accuracy and robustness, but also establishes a new paradigm for the automated understanding of complex scientific documents.

View paper on

Share this with someone who'll enjoy it:

Title:DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios

Paper and Code