Picture for Jun Zhao

Jun Zhao

WideSeek: Advancing Wide Research via Multi-Agent Scaling

Add code
Feb 02, 2026
Viaarxiv icon

S$^2$GR: Stepwise Semantic-Guided Reasoning in Latent Space for Generative Recommendation

Add code
Jan 26, 2026
Viaarxiv icon

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Add code
Jan 20, 2026
Viaarxiv icon

Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation

Add code
Jan 13, 2026
Viaarxiv icon

Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory

Add code
Jan 12, 2026
Viaarxiv icon

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Add code
Dec 22, 2025
Figure 1 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 2 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 3 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 4 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Viaarxiv icon

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Add code
Nov 13, 2025
Figure 1 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 2 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 3 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 4 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Viaarxiv icon

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Add code
Nov 06, 2025
Figure 1 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 2 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 3 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 4 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Viaarxiv icon

InfoFlow: Reinforcing Search Agent Via Reward Density Optimization

Add code
Oct 30, 2025
Viaarxiv icon

The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models

Add code
Oct 22, 2025
Viaarxiv icon