Picture for Lianwen Jin

Lianwen Jin

TextShield-R1: Reinforced Reasoning for Tampered Text Detection

Add code
Feb 23, 2026
Viaarxiv icon

Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding

Add code
Feb 13, 2026
Viaarxiv icon

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography

Add code
Jan 07, 2026
Viaarxiv icon

ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention

Add code
Dec 09, 2025
Viaarxiv icon

URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Add code
Nov 13, 2025
Figure 1 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 2 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 3 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 4 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Viaarxiv icon

Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation

Add code
Aug 28, 2025
Figure 1 for Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation
Figure 2 for Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation
Figure 3 for Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation
Figure 4 for Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation
Viaarxiv icon

MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers

Add code
Jul 09, 2025
Figure 1 for MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers
Figure 2 for MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers
Figure 3 for MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers
Figure 4 for MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers
Viaarxiv icon

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Add code
Jun 05, 2025
Figure 1 for MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
Figure 2 for MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
Figure 3 for MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
Figure 4 for MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
Viaarxiv icon

OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

Add code
May 22, 2025
Viaarxiv icon

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Add code
Apr 30, 2025
Figure 1 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 2 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 3 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 4 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Viaarxiv icon