Picture for Zhuokai Zhao

Zhuokai Zhao

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Viaarxiv icon

Thought Communication in Multiagent Collaboration

Add code
Oct 23, 2025
Viaarxiv icon

Exploring System 1 and 2 communication for latent reasoning in LLMs

Add code
Oct 01, 2025
Viaarxiv icon

Boosting LLM Reasoning via Spontaneous Self-Correction

Add code
Jun 07, 2025
Figure 1 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 2 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 3 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 4 for Boosting LLM Reasoning via Spontaneous Self-Correction
Viaarxiv icon

DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data

Add code
May 21, 2025
Viaarxiv icon

Transfer between Modalities with MetaQueries

Add code
Apr 08, 2025
Figure 1 for Transfer between Modalities with MetaQueries
Figure 2 for Transfer between Modalities with MetaQueries
Figure 3 for Transfer between Modalities with MetaQueries
Figure 4 for Transfer between Modalities with MetaQueries
Viaarxiv icon

S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning

Add code
Apr 08, 2025
Viaarxiv icon

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Add code
Mar 25, 2025
Viaarxiv icon

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Add code
Mar 10, 2025
Viaarxiv icon

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

Add code
Jan 16, 2025
Figure 1 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 2 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 3 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 4 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Viaarxiv icon