Picture for Conghui He

Conghui He

FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection

Add code
May 28, 2026
Viaarxiv icon

MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing

Add code
May 24, 2026
Viaarxiv icon

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Add code
May 13, 2026
Viaarxiv icon

Respecting Self-Uncertainty in On-Policy Self-Distillation for Efficient LLM Reasoning

Add code
May 13, 2026
Viaarxiv icon

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

Add code
May 11, 2026
Viaarxiv icon

NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized Research Automation

Add code
May 11, 2026
Viaarxiv icon

MolRecBench-Wild: A Real-World Benchmark for Optical Chemical Structure Recognition

Add code
May 07, 2026
Viaarxiv icon

Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora

Add code
Apr 27, 2026
Viaarxiv icon

Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Add code
Apr 12, 2026
Viaarxiv icon

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Add code
Apr 06, 2026
Viaarxiv icon