Picture for Zhuokai Zhao

Zhuokai Zhao

Agentic Recommender System with Hierarchical Belief-State Memory

Add code
May 14, 2026
Viaarxiv icon

Synthetic Sandbox for Training Machine Learning Engineering Agents

Add code
Apr 06, 2026
Viaarxiv icon

LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

Add code
Mar 26, 2026
Viaarxiv icon

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

Add code
Mar 19, 2026
Viaarxiv icon

Accelerating PDE Surrogates via RL-Guided Mesh Optimization

Add code
Mar 02, 2026
Viaarxiv icon

Token-Level LLM Collaboration via FusionRoute

Add code
Jan 08, 2026
Viaarxiv icon

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Figure 1 for Scaling Agent Learning via Experience Synthesis
Figure 2 for Scaling Agent Learning via Experience Synthesis
Figure 3 for Scaling Agent Learning via Experience Synthesis
Figure 4 for Scaling Agent Learning via Experience Synthesis
Viaarxiv icon

Thought Communication in Multiagent Collaboration

Add code
Oct 23, 2025
Viaarxiv icon

Exploring System 1 and 2 communication for latent reasoning in LLMs

Add code
Oct 01, 2025
Viaarxiv icon

Boosting LLM Reasoning via Spontaneous Self-Correction

Add code
Jun 07, 2025
Figure 1 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 2 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 3 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 4 for Boosting LLM Reasoning via Spontaneous Self-Correction
Viaarxiv icon