Picture for Conghui He

Conghui He

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Add code
Mar 23, 2026
Viaarxiv icon

Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing

Add code
Mar 17, 2026
Viaarxiv icon

AgenticOCR: Parsing Only What You Need for Efficient Retrieval-Augmented Generation

Add code
Feb 27, 2026
Viaarxiv icon

PointCoT: A Multi-modal Benchmark for Explicit 3D Geometric Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

MoDora: Tree-Based Semi-Structured Document Analysis System

Add code
Feb 26, 2026
Viaarxiv icon

The Trinity of Consistency as a Defining Principle for General World Models

Add code
Feb 26, 2026
Viaarxiv icon

NMRTrans: Structure Elucidation from Experimental NMR Spectra via Set Transformers

Add code
Feb 10, 2026
Viaarxiv icon

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Add code
Feb 09, 2026
Viaarxiv icon

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Add code
Jan 30, 2026
Viaarxiv icon

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Add code
Jan 29, 2026
Viaarxiv icon