Picture for Furu Wei

Furu Wei

Reward Reasoning Model

Add code
May 20, 2025
Figure 1 for Reward Reasoning Model
Figure 2 for Reward Reasoning Model
Figure 3 for Reward Reasoning Model
Figure 4 for Reward Reasoning Model
Viaarxiv icon

Efficient RL Training for Reasoning Models via Length-Aware Optimization

Add code
May 18, 2025
Viaarxiv icon

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Add code
Apr 25, 2025
Figure 1 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 2 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 3 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Figure 4 for BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

BitNet b1.58 2B4T Technical Report

Add code
Apr 16, 2025
Figure 1 for BitNet b1.58 2B4T Technical Report
Figure 2 for BitNet b1.58 2B4T Technical Report
Figure 3 for BitNet b1.58 2B4T Technical Report
Figure 4 for BitNet b1.58 2B4T Technical Report
Viaarxiv icon

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Add code
Mar 27, 2025
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Figure 1 for Scaling Laws of Synthetic Data for Language Models
Figure 2 for Scaling Laws of Synthetic Data for Language Models
Figure 3 for Scaling Laws of Synthetic Data for Language Models
Figure 4 for Scaling Laws of Synthetic Data for Language Models
Viaarxiv icon

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Add code
Feb 25, 2025
Viaarxiv icon

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale

Add code
Feb 23, 2025
Figure 1 for WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
Figure 2 for WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
Figure 3 for WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
Figure 4 for WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
Viaarxiv icon

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Add code
Feb 17, 2025
Viaarxiv icon