Picture for Junbo Zhao

Junbo Zhao

Jake

Momentum for Reasoning: Dense Intrinsic Signals in Policy Optimization

Add code
Jun 07, 2026
Viaarxiv icon

SkillComposer: Learning to Evolve Agent Skills for Specification and Generalization

Add code
Jun 04, 2026
Viaarxiv icon

From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment

Add code
May 20, 2026
Viaarxiv icon

Can LLMs Act as Historians? Evaluating Historical Research Capabilities of LLMs via the Chinese Imperial Examination

Add code
Apr 27, 2026
Viaarxiv icon

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Add code
Apr 22, 2026
Viaarxiv icon

DeltaMem: Towards Agentic Memory Management via Reinforcement Learning

Add code
Apr 02, 2026
Viaarxiv icon

Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation

Add code
Apr 01, 2026
Viaarxiv icon

FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

Add code
Mar 17, 2026
Viaarxiv icon

On Multi-Step Theorem Prediction via Non-Parametric Structural Priors

Add code
Mar 05, 2026
Viaarxiv icon

KMLP: A Scalable Hybrid Architecture for Web-Scale Tabular Data Modeling

Add code
Feb 26, 2026
Viaarxiv icon