Picture for Dezhi Peng

Dezhi Peng

Reinforcement Learning with Robust Rubric Rewards

Add code
May 28, 2026
Viaarxiv icon

Visual Preference Optimization with Rubric Rewards

Add code
Apr 14, 2026
Viaarxiv icon

URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding

Add code
Nov 13, 2025
Figure 1 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 2 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 3 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Figure 4 for URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Viaarxiv icon

OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

Add code
May 22, 2025
Viaarxiv icon

Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs

Add code
Jan 31, 2025
Figure 1 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 2 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 3 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 4 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Viaarxiv icon

Predicting the Original Appearance of Damaged Historical Documents

Add code
Dec 16, 2024
Figure 1 for Predicting the Original Appearance of Damaged Historical Documents
Figure 2 for Predicting the Original Appearance of Damaged Historical Documents
Figure 3 for Predicting the Original Appearance of Damaged Historical Documents
Figure 4 for Predicting the Original Appearance of Damaged Historical Documents
Viaarxiv icon

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models

Add code
Jul 04, 2024
Viaarxiv icon

C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models

Add code
May 28, 2024
Viaarxiv icon

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Add code
May 07, 2024
Figure 1 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 2 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 3 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 4 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Figure 1 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 2 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 3 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 4 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Viaarxiv icon