Picture for Wayne Xin Zhao

Wayne Xin Zhao

Controlled LLM Training on Spectral Sphere

Add code
Jan 13, 2026
Viaarxiv icon

VIPER: Process-aware Evaluation for Generative Video Reasoning

Add code
Dec 31, 2025
Viaarxiv icon

Entropy-Guided Token Dropout: Training Autoregressive Language Models with Limited Domain Data

Add code
Dec 29, 2025
Viaarxiv icon

Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards

Add code
Dec 25, 2025
Viaarxiv icon

Scaling Laws for Code: Every Programming Language Matters

Add code
Dec 15, 2025
Figure 1 for Scaling Laws for Code: Every Programming Language Matters
Figure 2 for Scaling Laws for Code: Every Programming Language Matters
Figure 3 for Scaling Laws for Code: Every Programming Language Matters
Figure 4 for Scaling Laws for Code: Every Programming Language Matters
Viaarxiv icon

Spatio-Temporal Data Enhanced Vision-Language Model for Traffic Scene Understanding

Add code
Nov 12, 2025
Viaarxiv icon

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Add code
Nov 10, 2025
Viaarxiv icon

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Add code
Oct 06, 2025
Viaarxiv icon

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

Add code
Sep 05, 2025
Viaarxiv icon

STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning

Add code
Aug 26, 2025
Viaarxiv icon