Picture for Zewen Chi

Zewen Chi

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

Add code
Mar 08, 2026
Viaarxiv icon

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

VIBEVOICE-ASR Technical Report

Add code
Jan 26, 2026
Viaarxiv icon

Black-Box On-Policy Distillation of Large Language Models

Add code
Nov 13, 2025
Viaarxiv icon

The Era of Agentic Organization: Learning to Organize with Language Models

Add code
Oct 30, 2025
Figure 1 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 2 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 3 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 4 for The Era of Agentic Organization: Learning to Organize with Language Models
Viaarxiv icon

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Add code
Oct 27, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Figure 1 for Reward Reasoning Model
Figure 2 for Reward Reasoning Model
Figure 3 for Reward Reasoning Model
Figure 4 for Reward Reasoning Model
Viaarxiv icon