Picture for Yongxin Shi

Yongxin Shi

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Add code
Jun 05, 2025
Viaarxiv icon

OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

Add code
May 22, 2025
Viaarxiv icon

Predicting the Original Appearance of Damaged Historical Documents

Add code
Dec 16, 2024
Figure 1 for Predicting the Original Appearance of Damaged Historical Documents
Figure 2 for Predicting the Original Appearance of Damaged Historical Documents
Figure 3 for Predicting the Original Appearance of Damaged Historical Documents
Figure 4 for Predicting the Original Appearance of Damaged Historical Documents
Viaarxiv icon

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models

Add code
Jul 04, 2024
Viaarxiv icon

C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models

Add code
May 28, 2024
Viaarxiv icon

UPOCR: Towards Unified Pixel-Level OCR Interface

Add code
Dec 05, 2023
Viaarxiv icon

Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation

Add code
Oct 29, 2023
Figure 1 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 2 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 3 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 4 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Viaarxiv icon