Picture for Jingren Zhou

Jingren Zhou

additional authors not shown

One-Way Policy Optimization for Self-Evolving LLMs

Add code
May 21, 2026
Viaarxiv icon

Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals

Add code
May 21, 2026
Viaarxiv icon

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Add code
May 12, 2026
Viaarxiv icon

Qwen-Image-2.0 Technical Report

Add code
May 11, 2026
Viaarxiv icon

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

Add code
Apr 21, 2026
Viaarxiv icon

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Add code
Mar 29, 2026
Viaarxiv icon

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Add code
Mar 23, 2026
Viaarxiv icon

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Add code
Mar 20, 2026
Viaarxiv icon

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Add code
Feb 15, 2026
Viaarxiv icon