Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Neural CRF Model for Sentence Alignment in Text Simplification

May 18, 2020

Chao Jiang, Mounica Maddela, Wuwei Lan, Yang Zhong, Wei Xu

Figure 1 for Neural CRF Model for Sentence Alignment in Text Simplification

Figure 2 for Neural CRF Model for Sentence Alignment in Text Simplification

Figure 3 for Neural CRF Model for Sentence Alignment in Text Simplification

Figure 4 for Neural CRF Model for Sentence Alignment in Text Simplification

Share this with someone who'll enjoy it:

Abstract:The success of a text simplification system heavily depends on the quality and quantity of complex-simple sentence pairs in the training corpus, which are extracted by aligning sentences between parallel articles. To evaluate and improve sentence alignment quality, we create two manually annotated sentence-aligned datasets from two commonly used text simplification corpora, Newsela and Wikipedia. We propose a novel neural CRF alignment model which not only leverages the sequential nature of sentences in parallel documents but also utilizes a neural sentence pair model to capture semantic similarity. Experiments demonstrate that our proposed approach outperforms all the previous work on monolingual sentence alignment task by more than 5 points in F1. We apply our CRF aligner to construct two new text simplification datasets, Newsela-Auto and Wiki-Auto, which are much larger and of better quality compared to the existing datasets. A Transformer-based seq2seq model trained on our datasets establishes a new state-of-the-art for text simplification in both automatic and human evaluation.

* The paper has been accepted to ACL 2020

View paper on

Share this with someone who'll enjoy it:

Title:Neural CRF Model for Sentence Alignment in Text Simplification

Paper and Code