Picture for Jun Zhao

Jun Zhao

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Add code
Jan 20, 2026
Viaarxiv icon

Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation

Add code
Jan 13, 2026
Viaarxiv icon

Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory

Add code
Jan 12, 2026
Viaarxiv icon

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Add code
Dec 22, 2025
Figure 1 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 2 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 3 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Figure 4 for Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Viaarxiv icon

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Add code
Nov 13, 2025
Figure 1 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 2 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 3 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 4 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Viaarxiv icon

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Add code
Nov 06, 2025
Figure 1 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 2 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 3 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 4 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Viaarxiv icon

InfoFlow: Reinforcing Search Agent Via Reward Density Optimization

Add code
Oct 30, 2025
Viaarxiv icon

The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models

Add code
Oct 22, 2025
Viaarxiv icon

Towards Agentic Self-Learning LLMs in Search Environment

Add code
Oct 16, 2025
Figure 1 for Towards Agentic Self-Learning LLMs in Search Environment
Figure 2 for Towards Agentic Self-Learning LLMs in Search Environment
Figure 3 for Towards Agentic Self-Learning LLMs in Search Environment
Figure 4 for Towards Agentic Self-Learning LLMs in Search Environment
Viaarxiv icon

MotivGraph-SoIQ: Integrating Motivational Knowledge Graphs and Socratic Dialogue for Enhanced LLM Ideation

Add code
Sep 26, 2025
Figure 1 for MotivGraph-SoIQ: Integrating Motivational Knowledge Graphs and Socratic Dialogue for Enhanced LLM Ideation
Figure 2 for MotivGraph-SoIQ: Integrating Motivational Knowledge Graphs and Socratic Dialogue for Enhanced LLM Ideation
Figure 3 for MotivGraph-SoIQ: Integrating Motivational Knowledge Graphs and Socratic Dialogue for Enhanced LLM Ideation
Figure 4 for MotivGraph-SoIQ: Integrating Motivational Knowledge Graphs and Socratic Dialogue for Enhanced LLM Ideation
Viaarxiv icon