Picture for Lijun Wu

Lijun Wu

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

Add code
Aug 29, 2025
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Viaarxiv icon

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Add code
Jul 23, 2025
Viaarxiv icon

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Add code
May 22, 2025
Viaarxiv icon

IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment

Add code
May 19, 2025
Viaarxiv icon

Efficient Reasoning for LLMs through Speculative Chain-of-Thought

Add code
Apr 27, 2025
Viaarxiv icon

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

Add code
Apr 27, 2025
Viaarxiv icon

DONOD: Robust and Generalizable Instruction Fine-Tuning for LLMs via Model-Intrinsic Dataset Pruning

Add code
Apr 21, 2025
Viaarxiv icon