Picture for Zhiyuan Zeng

Zhiyuan Zeng

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Add code
May 21, 2024
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

Turn Waste into Worth: Rectifying Top-$k$ Router of MoE

Add code
Feb 21, 2024
Viaarxiv icon

Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora

Add code
Jan 26, 2024
Figure 1 for Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Figure 2 for Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Figure 3 for Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Figure 4 for Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Viaarxiv icon

Evaluating Large Language Models at Evaluating Instruction Following

Add code
Oct 11, 2023
Figure 1 for Evaluating Large Language Models at Evaluating Instruction Following
Figure 2 for Evaluating Large Language Models at Evaluating Instruction Following
Figure 3 for Evaluating Large Language Models at Evaluating Instruction Following
Figure 4 for Evaluating Large Language Models at Evaluating Instruction Following
Viaarxiv icon

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Add code
Oct 10, 2023
Viaarxiv icon

Emergent Modularity in Pre-trained Transformers

Add code
May 28, 2023
Figure 1 for Emergent Modularity in Pre-trained Transformers
Figure 2 for Emergent Modularity in Pre-trained Transformers
Figure 3 for Emergent Modularity in Pre-trained Transformers
Figure 4 for Emergent Modularity in Pre-trained Transformers
Viaarxiv icon

Plug-and-Play Knowledge Injection for Pre-trained Language Models

Add code
May 28, 2023
Figure 1 for Plug-and-Play Knowledge Injection for Pre-trained Language Models
Figure 2 for Plug-and-Play Knowledge Injection for Pre-trained Language Models
Figure 3 for Plug-and-Play Knowledge Injection for Pre-trained Language Models
Figure 4 for Plug-and-Play Knowledge Injection for Pre-trained Language Models
Viaarxiv icon

KNIFE: Knowledge Distillation with Free-Text Rationales

Add code
Dec 19, 2022
Figure 1 for KNIFE: Knowledge Distillation with Free-Text Rationales
Figure 2 for KNIFE: Knowledge Distillation with Free-Text Rationales
Figure 3 for KNIFE: Knowledge Distillation with Free-Text Rationales
Figure 4 for KNIFE: Knowledge Distillation with Free-Text Rationales
Viaarxiv icon

Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Add code
Oct 17, 2022
Figure 1 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
Figure 2 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
Figure 3 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
Figure 4 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
Viaarxiv icon