Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Cheung

Lex-BERT: Enhancing BERT based NER with lexicons

Jan 02, 2021

Wei Zhu, Daniel Cheung

Figure 1 for Lex-BERT: Enhancing BERT based NER with lexicons

Figure 2 for Lex-BERT: Enhancing BERT based NER with lexicons

Figure 3 for Lex-BERT: Enhancing BERT based NER with lexicons

Abstract:In this work, we represent Lex-BERT, which incorporates the lexicon information into Chinese BERT for named entity recognition (NER) tasks in a natural manner. Instead of using word embeddings and a newly designed transformer layer as in FLAT, we identify the boundary of words in the sentences using special tokens, and the modified sentence will be encoded directly by BERT. Our model does not introduce any new parameters and are more efficient than FLAT. In addition, we do not require any word embeddings accompanying the lexicon collection. Experiments on Ontonotes and ZhCrossNER show that our model outperforms FLAT and other baselines.

Via

Access Paper or Ask Questions

CMV-BERT: Contrastive multi-vocab pretraining of BERT

Dec 29, 2020

Wei Zhu, Daniel Cheung

Figure 1 for CMV-BERT: Contrastive multi-vocab pretraining of BERT

Figure 2 for CMV-BERT: Contrastive multi-vocab pretraining of BERT

Abstract:In this work, we represent CMV-BERT, which improves the pretraining of a language model via two ingredients: (a) contrastive learning, which is well studied in the area of computer vision; (b) multiple vocabularies, one of which is fine-grained and the other is coarse-grained. The two methods both provide different views of an original sentence, and both are shown to be beneficial. Downstream tasks demonstrate our proposed CMV-BERT are effective in improving the pretrained language models.

Via

Access Paper or Ask Questions