Picture for Furu Wei

Furu Wei

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Add code
Mar 05, 2024
Figure 1 for MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Figure 2 for MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Figure 3 for MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Figure 4 for MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Viaarxiv icon

Towards Optimal Learning of Language Models

Add code
Mar 03, 2024
Viaarxiv icon

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

Add code
Feb 28, 2024
Figure 1 for ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Figure 2 for ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Figure 3 for ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Figure 4 for ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Viaarxiv icon

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Add code
Feb 27, 2024
Viaarxiv icon

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Add code
Feb 26, 2024
Figure 1 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 2 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 3 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 4 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Viaarxiv icon

HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition

Add code
Feb 24, 2024
Figure 1 for HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
Figure 2 for HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
Figure 3 for HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
Figure 4 for HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
Viaarxiv icon

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Add code
Feb 20, 2024
Viaarxiv icon

Text Diffusion with Reinforced Conditioning

Add code
Feb 19, 2024
Figure 1 for Text Diffusion with Reinforced Conditioning
Figure 2 for Text Diffusion with Reinforced Conditioning
Figure 3 for Text Diffusion with Reinforced Conditioning
Figure 4 for Text Diffusion with Reinforced Conditioning
Viaarxiv icon

Generative Representational Instruction Tuning

Add code
Feb 15, 2024
Figure 1 for Generative Representational Instruction Tuning
Figure 2 for Generative Representational Instruction Tuning
Figure 3 for Generative Representational Instruction Tuning
Figure 4 for Generative Representational Instruction Tuning
Viaarxiv icon

Multilingual E5 Text Embeddings: A Technical Report

Add code
Feb 08, 2024
Viaarxiv icon