Picture for Boxing Chen

Boxing Chen

Huawei Noah's Ark Lab

TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

Add code
Feb 13, 2026
Viaarxiv icon

Distributed Hybrid Parallelism for Large Language Models: Comparative Study and System Design Guide

Add code
Feb 09, 2026
Viaarxiv icon

EPAS: Efficient Training with Progressive Activation Sharing

Add code
Jan 27, 2026
Viaarxiv icon

Thinking Long, but Short: Stable Sequential Test-Time Scaling for Large Reasoning Models

Add code
Jan 14, 2026
Viaarxiv icon

A method for improving multilingual quality and diversity of instruction fine-tuning datasets

Add code
Sep 19, 2025
Viaarxiv icon

RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

Nested-ReFT: Efficient Reinforcement Learning for Large Language Model Fine-Tuning via Off-Policy Rollouts

Add code
Aug 13, 2025
Viaarxiv icon

PoTPTQ: A Two-step Power-of-Two Post-training for LLMs

Add code
Jul 16, 2025
Figure 1 for PoTPTQ: A Two-step Power-of-Two Post-training for LLMs
Figure 2 for PoTPTQ: A Two-step Power-of-Two Post-training for LLMs
Figure 3 for PoTPTQ: A Two-step Power-of-Two Post-training for LLMs
Figure 4 for PoTPTQ: A Two-step Power-of-Two Post-training for LLMs
Viaarxiv icon

ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training

Add code
May 22, 2025
Viaarxiv icon

Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Add code
Mar 28, 2025
Figure 1 for Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Figure 2 for Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Figure 3 for Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Figure 4 for Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Viaarxiv icon