Picture for Zhijian Zhou

Zhijian Zhou

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training

Add code
May 28, 2025
Figure 1 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 2 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 3 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 4 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Viaarxiv icon

ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools

Add code
May 27, 2025
Figure 1 for ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools
Figure 2 for ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools
Figure 3 for ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools
Figure 4 for ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools
Viaarxiv icon

Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem

Add code
Nov 30, 2024
Figure 1 for Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem
Figure 2 for Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem
Figure 3 for Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem
Figure 4 for Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem
Viaarxiv icon