Picture for Jingren Zhou

Jingren Zhou

additional authors not shown

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Add code
Mar 23, 2026
Viaarxiv icon

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Add code
Mar 20, 2026
Viaarxiv icon

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Add code
Feb 15, 2026
Viaarxiv icon

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Add code
Feb 02, 2026
Viaarxiv icon

A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training

Add code
Jan 30, 2026
Viaarxiv icon

Qwen3-ASR Technical Report

Add code
Jan 29, 2026
Viaarxiv icon

Qwen3-TTS Technical Report

Add code
Jan 22, 2026
Viaarxiv icon

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Add code
Jan 22, 2026
Viaarxiv icon

Evidence-Augmented Policy Optimization with Reward Co-Evolution for Long-Context Reasoning

Add code
Jan 15, 2026
Viaarxiv icon