Picture for Bo Zheng

Bo Zheng

additional authors not shown

A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training

Add code
Jan 30, 2026
Viaarxiv icon

Modeling Cascaded Delay Feedback for Online Net Conversion Rate Prediction: Benchmark, Insights and Solutions

Add code
Jan 29, 2026
Viaarxiv icon

CE-RM: A Pointwise Generative Reward Model Optimized via Two-Stage Rollout and Unified Criteria

Add code
Jan 28, 2026
Viaarxiv icon

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants

Add code
Jan 26, 2026
Viaarxiv icon

Multi-Behavior Sequential Modeling with Transition-Aware Graph Attention Network for E-Commerce Recommendation

Add code
Jan 21, 2026
Viaarxiv icon

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Add code
Jan 21, 2026
Viaarxiv icon

DecisionLLM: Large Language Models for Long Sequence Decision Exploration

Add code
Jan 15, 2026
Viaarxiv icon

Parallel Latent Reasoning for Sequential Recommendation

Add code
Jan 06, 2026
Viaarxiv icon

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Add code
Jan 06, 2026
Viaarxiv icon

Unified Thinker: A General Reasoning Modular Core for Image Generation

Add code
Jan 06, 2026
Viaarxiv icon