Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Apr 26, 2022

Junwei Liao, Duyu Tang, Fan Zhang, Shuming Shi

Figure 1 for SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Figure 2 for SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Figure 3 for SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Figure 4 for SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Share this with someone who'll enjoy it:

Abstract:We present SkillNet-NLG, a sparsely activated approach that handles many natural language generation tasks with one model. Different from traditional dense models that always activate all the parameters, SkillNet-NLG selectively activates relevant parts of the parameters to accomplish a task, where the relevance is controlled by a set of predefined skills. The strength of such model design is that it provides an opportunity to precisely adapt relevant skills to learn new tasks effectively. We evaluate on Chinese natural language generation tasks. Results show that, with only one model file, SkillNet-NLG outperforms previous best performance methods on four of five tasks. SkillNet-NLG performs better than two multi-task learning baselines (a dense model and a Mixture-of-Expert model) and achieves comparable performance to task-specific models. Lastly, SkillNet-NLG surpasses baseline systems when being adapted to new tasks.

* 8 pages,3 figures

View paper on

Share this with someone who'll enjoy it:

Title:SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

Paper and Code