Picture for Furu Wei

Furu Wei

Reinforcement Pre-Training

Add code
Jun 09, 2025
Figure 1 for Reinforcement Pre-Training
Figure 2 for Reinforcement Pre-Training
Figure 3 for Reinforcement Pre-Training
Figure 4 for Reinforcement Pre-Training
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Figure 1 for Reward Reasoning Model
Figure 2 for Reward Reasoning Model
Figure 3 for Reward Reasoning Model
Figure 4 for Reward Reasoning Model
Viaarxiv icon

Efficient RL Training for Reasoning Models via Length-Aware Optimization

Add code
May 18, 2025
Viaarxiv icon

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Add code
Apr 25, 2025
Figure 1 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 2 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 3 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 4 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

BitNet b1.58 2B4T Technical Report

Add code
Apr 16, 2025
Figure 1 for BitNet b1.58 2B4T Technical Report
Figure 2 for BitNet b1.58 2B4T Technical Report
Figure 3 for BitNet b1.58 2B4T Technical Report
Figure 4 for BitNet b1.58 2B4T Technical Report
Viaarxiv icon

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Add code
Mar 27, 2025
Viaarxiv icon