Picture for Peng Pei

Peng Pei

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Exploring Reasoning Reward Model for Agents

Add code
Jan 29, 2026
Viaarxiv icon

Scaling Embeddings Outperforms Scaling Experts in Language Models

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Add code
Jan 23, 2026
Viaarxiv icon

GIFT: Unlocking Global Optimality in Post-Training via Finite-Temperature Gibbs Initialization

Add code
Jan 14, 2026
Viaarxiv icon

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

Add code
Dec 10, 2025
Viaarxiv icon

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Add code
Jun 17, 2025
Viaarxiv icon

GradPower: Powering Gradients for Faster Language Model Pre-Training

Add code
May 30, 2025
Figure 1 for GradPower: Powering Gradients for Faster Language Model Pre-Training
Figure 2 for GradPower: Powering Gradients for Faster Language Model Pre-Training
Figure 3 for GradPower: Powering Gradients for Faster Language Model Pre-Training
Figure 4 for GradPower: Powering Gradients for Faster Language Model Pre-Training
Viaarxiv icon