Picture for Furu Wei

Furu Wei

VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models

Add code
Aug 13, 2025
Viaarxiv icon

Geometric-Mean Policy Optimization

Add code
Jul 28, 2025
Viaarxiv icon

Reasoning with Exploration: An Entropy Perspective

Add code
Jun 17, 2025
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Viaarxiv icon

Efficient RL Training for Reasoning Models via Length-Aware Optimization

Add code
May 18, 2025
Viaarxiv icon

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Add code
Apr 25, 2025
Viaarxiv icon