Alert button

InstructIE: A Chinese Instruction-based Information Extraction Dataset

May 19, 2023
Honghao Gui, Jintian Zhang, Hongbin Ye, Ningyu Zhang

Figure 1 for InstructIE: A Chinese Instruction-based Information Extraction Dataset
Figure 2 for InstructIE: A Chinese Instruction-based Information Extraction Dataset
Figure 3 for InstructIE: A Chinese Instruction-based Information Extraction Dataset
Figure 4 for InstructIE: A Chinese Instruction-based Information Extraction Dataset

Share this with someone who'll enjoy it:

We introduce a new Information Extraction (IE) task dubbed Instruction-based IE, which aims to ask the system to follow specific instructions or guidelines to extract information. To facilitate research in this area, we construct a dataset called InstructIE, consisting of 270,000 weakly supervised data from Chinese Wikipedia and 1,000 high-quality crowdsourced annotated instances. We further evaluate the performance of various baseline models on the InstructIE dataset. The results reveal that although current models exhibit promising performance, there is still room for improvement. Furthermore, we conduct a comprehensive case study analysis, underlining the challenges inherent in the Instruction-based IE task. Code and dataset are available at https://github.com/zjunlp/DeepKE/tree/main/example/llm.

* Work in progress  
View paper onarxiv icon

Share this with someone who'll enjoy it: