Picture for Lijun Wu

Lijun Wu

Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Add code
Apr 12, 2026
Viaarxiv icon

Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Add code
Apr 06, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing

Add code
Mar 17, 2026
Viaarxiv icon

Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning

Add code
Mar 05, 2026
Viaarxiv icon

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Add code
Jan 30, 2026
Viaarxiv icon

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Add code
Jan 29, 2026
Viaarxiv icon

Parameter Inference and Uncertainty Quantification with Diffusion Models: Extending CDI to 2D Spatial Conditioning

Add code
Jan 23, 2026
Viaarxiv icon

ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch

Add code
Jan 20, 2026
Viaarxiv icon