Picture for Lijun Wu

Lijun Wu

SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization

Add code
Jan 07, 2026
Viaarxiv icon

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Add code
Dec 16, 2025
Figure 1 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 2 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 3 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 4 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Viaarxiv icon

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

Add code
Nov 14, 2025
Figure 1 for GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Figure 2 for GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Figure 3 for GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Figure 4 for GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Viaarxiv icon

Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs

Add code
Oct 27, 2025
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

Add code
Aug 29, 2025
Figure 1 for Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Figure 2 for Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Figure 3 for Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Figure 4 for Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Add code
Jul 23, 2025
Viaarxiv icon

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

Add code
Jun 09, 2025
Figure 1 for GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Figure 2 for GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Figure 3 for GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Figure 4 for GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Viaarxiv icon