Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Revisiting Calibration for Question Answering


May 25, 2022
Chenglei Si, Chen Zhao, Sewon Min, Jordan Boyd-Graber

* Preprint; Feedback is welcome 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP


Dec 20, 2021
Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan

* 15 page preprint 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

What's in a Name? Answer Equivalence For Open-Domain Question Answering


Sep 11, 2021
Chenglei Si, Chen Zhao, Jordan Boyd-Graber

* EMNLP 2021 main conference 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adversarial Training for Machine Reading Comprehension with Virtual Embeddings


Jun 08, 2021
Ziqing Yang, Yiming Cui, Chenglei Si, Wanxiang Che, Ting Liu, Shijin Wang, Guoping Hu

* Accepted to *SEM 2021 workshop at ACL 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SHUOWEN-JIEZI: Linguistically Informed Tokenizers For Chinese Language Model Pretraining


Jun 01, 2021
Chenglei Si, Zhengyan Zhang, Yingfa Chen, Fanchao Qi, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun

* Work in progress. Feedback is welcome 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning


Dec 31, 2020
Chenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun

* 9 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CharBERT: Character-aware Pre-trained Language Model


Nov 03, 2020
Wentao Ma, Yiming Cui, Chenglei Si, Ting Liu, Shijin Wang, Guoping Hu

* 12 pages, to appear at COLING 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Benchmarking Robustness of Machine Reading Comprehension Models


Apr 29, 2020
Chenglei Si, Ziqing Yang, Yiming Cui, Wentao Ma, Ting Liu, Shijin Wang

* Work in progress 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?


Oct 28, 2019
Chenglei Si, Shuohang Wang, Min-Yen Kan, Jing Jiang

* 10 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email