Picture for Furu Wei

Furu Wei

Reasoning with Exploration: An Entropy Perspective

Add code
Jun 17, 2025
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Viaarxiv icon

Efficient RL Training for Reasoning Models via Length-Aware Optimization

Add code
May 18, 2025
Viaarxiv icon

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Add code
Apr 25, 2025
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

BitNet b1.58 2B4T Technical Report

Add code
Apr 16, 2025
Viaarxiv icon