Picture for Yang Zhou

Yang Zhou

Yahoo! Labs

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS

Add code
Aug 19, 2025
Viaarxiv icon

GhostShell: Streaming LLM Function Calls for Concurrent Embodied Programming

Add code
Aug 07, 2025
Viaarxiv icon

Demystify Protein Generation with Hierarchical Conditional Diffusion Models

Add code
Jul 24, 2025
Viaarxiv icon

StreamME: Simplify 3D Gaussian Avatar within Live Stream

Add code
Jul 22, 2025
Viaarxiv icon

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning

Add code
Jul 17, 2025
Viaarxiv icon

UVLM: Benchmarking Video Language Model for Underwater World Understanding

Add code
Jul 03, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

Robust OOD Graph Learning via Mean Constraints and Noise Reduction

Add code
Jun 24, 2025
Viaarxiv icon

A Batch-Insensitive Dynamic GNN Approach to Address Temporal Discontinuity in Graph Streams

Add code
Jun 24, 2025
Viaarxiv icon