Picture for Yuxian Gu

Yuxian Gu

Direct Preference Knowledge Distillation for Large Language Models

Add code
Jun 28, 2024
Viaarxiv icon

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Add code
Jun 20, 2024
Viaarxiv icon

Towards Optimal Learning of Language Models

Add code
Mar 03, 2024
Viaarxiv icon

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Add code
Feb 20, 2024
Figure 1 for Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Figure 2 for Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Figure 3 for Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Figure 4 for Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Viaarxiv icon

Knowledge Distillation of Large Language Models

Add code
Jun 14, 2023
Figure 1 for Knowledge Distillation of Large Language Models
Figure 2 for Knowledge Distillation of Large Language Models
Figure 3 for Knowledge Distillation of Large Language Models
Figure 4 for Knowledge Distillation of Large Language Models
Viaarxiv icon

Pre-Training to Learn in Context

Add code
May 16, 2023
Figure 1 for Pre-Training to Learn in Context
Figure 2 for Pre-Training to Learn in Context
Figure 3 for Pre-Training to Learn in Context
Figure 4 for Pre-Training to Learn in Context
Viaarxiv icon

Structured Prompting: Scaling In-Context Learning to 1,000 Examples

Add code
Dec 13, 2022
Figure 1 for Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Figure 2 for Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Figure 3 for Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Figure 4 for Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Viaarxiv icon

Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization

Add code
Oct 17, 2022
Figure 1 for Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization
Figure 2 for Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization
Figure 3 for Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization
Figure 4 for Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization
Viaarxiv icon

Many-Class Text Classification with Matching

Add code
May 23, 2022
Figure 1 for Many-Class Text Classification with Matching
Figure 2 for Many-Class Text Classification with Matching
Figure 3 for Many-Class Text Classification with Matching
Figure 4 for Many-Class Text Classification with Matching
Viaarxiv icon

EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

Add code
Mar 17, 2022
Figure 1 for EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Figure 2 for EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Figure 3 for EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Figure 4 for EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Viaarxiv icon