Picture for Jennifer Yuntong Zhang

Jennifer Yuntong Zhang

WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning

Add code
Feb 19, 2026
Viaarxiv icon

SceneAlign: Aligning Multimodal Reasoning to Scene Graphs in Complex Visual Scenes

Add code
Jan 09, 2026
Viaarxiv icon

Realistic CDSS Drug Dosing with End-to-end Recurrent Q-learning for Dual Vasopressor Control

Add code
Oct 01, 2025
Viaarxiv icon

In-Context Algorithm Emulation in Fixed-Weight Transformers

Add code
Aug 24, 2025
Figure 1 for In-Context Algorithm Emulation in Fixed-Weight Transformers
Figure 2 for In-Context Algorithm Emulation in Fixed-Weight Transformers
Figure 3 for In-Context Algorithm Emulation in Fixed-Weight Transformers
Figure 4 for In-Context Algorithm Emulation in Fixed-Weight Transformers
Viaarxiv icon