Picture for Lianwen Jin

Lianwen Jin

Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models

Add code
Mar 17, 2025
Figure 1 for Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models
Figure 2 for Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models
Figure 3 for Online Signature Verification based on the Lagrange formulation with 2D and 3D robotic models
Viaarxiv icon

Privacy-Preserving Biometric Verification with Handwritten Random Digit String

Add code
Mar 17, 2025
Viaarxiv icon

Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs

Add code
Jan 31, 2025
Figure 1 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 2 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 3 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 4 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Viaarxiv icon

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

Explainable Tampered Text Detection via Multimodal Large Models

Add code
Dec 19, 2024
Figure 1 for Explainable Tampered Text Detection via Multimodal Large Models
Figure 2 for Explainable Tampered Text Detection via Multimodal Large Models
Figure 3 for Explainable Tampered Text Detection via Multimodal Large Models
Figure 4 for Explainable Tampered Text Detection via Multimodal Large Models
Viaarxiv icon

Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach

Add code
Dec 16, 2024
Figure 1 for Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Figure 2 for Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Figure 3 for Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Figure 4 for Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Viaarxiv icon

Predicting the Original Appearance of Damaged Historical Documents

Add code
Dec 16, 2024
Figure 1 for Predicting the Original Appearance of Damaged Historical Documents
Figure 2 for Predicting the Original Appearance of Damaged Historical Documents
Figure 3 for Predicting the Original Appearance of Damaged Historical Documents
Figure 4 for Predicting the Original Appearance of Damaged Historical Documents
Viaarxiv icon

Omni-IML: Towards Unified Image Manipulation Localization

Add code
Nov 22, 2024
Figure 1 for Omni-IML: Towards Unified Image Manipulation Localization
Figure 2 for Omni-IML: Towards Unified Image Manipulation Localization
Figure 3 for Omni-IML: Towards Unified Image Manipulation Localization
Figure 4 for Omni-IML: Towards Unified Image Manipulation Localization
Viaarxiv icon

VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models

Add code
Oct 01, 2024
Viaarxiv icon