Picture for Lianwen Jin

Lianwen Jin

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Add code
Jun 05, 2025
Viaarxiv icon

OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

Add code
May 22, 2025
Viaarxiv icon

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Add code
Apr 30, 2025
Viaarxiv icon

Privacy-Preserving Biometric Verification with Handwritten Random Digit String

Add code
Mar 17, 2025
Viaarxiv icon

Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models

Add code
Mar 17, 2025
Viaarxiv icon

Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs

Add code
Jan 31, 2025
Figure 1 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 2 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 3 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 4 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Viaarxiv icon

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

Explainable Tampered Text Detection via Multimodal Large Models

Add code
Dec 19, 2024
Viaarxiv icon

Predicting the Original Appearance of Damaged Historical Documents

Add code
Dec 16, 2024
Figure 1 for Predicting the Original Appearance of Damaged Historical Documents
Figure 2 for Predicting the Original Appearance of Damaged Historical Documents
Figure 3 for Predicting the Original Appearance of Damaged Historical Documents
Figure 4 for Predicting the Original Appearance of Damaged Historical Documents
Viaarxiv icon