4.7 Article

Research on the Chinese Named-Entity-Relation-Extraction Method for Crop Diseases Based on BERT

Journal

AGRONOMY-BASEL
Volume 12, Issue 9, Pages -

Publisher

MDPI
DOI: 10.3390/agronomy12092130

Keywords

knowledge graph; crop disease; named-entity recognition; relation extraction; BERT; PERT

Funding

  1. National Key Research and Development Program of China [2020YFD1100204]
  2. Natural Science Foundation of Hebei province [F2022204004]
  3. Hebei Province Key Research and Development Program [20327402D, 19227210D]

Ask authors/readers for more resources

An entity-relation-extraction model for crop diseases (BBCPF) was proposed in this paper, utilizing the advantages of knowledge graphs. Through the use of BERT, BiLSTM, and CRF models, the precision and recall values were optimized for entity and relation extraction, providing an effective method for knowledge graph construction in the Chinese crop disease domain.
In order to integrate fragmented text data of crop disease knowledge to solve the current problems of disordered knowledge management, weak correlation and difficulty in knowledge sharing, a Chinese named-entity-relation-extraction model for crop diseases (BBCPF) was proposed in this paper by utilizing the advantage of knowledge graph in describing complex relations between disease entities in a structured form. This model was composed of two parts, i.e., named-entity recognition and relation extraction, in the form of an assembly line. To deal with the different meanings of Chinese crop disease terms in different contexts and to better obtain the contextual information, the BERT model was introduced for dynamic vector representations. Then, the BiLSTM layer was used to learn long-distance text information, and the CRF was applied to obtain the globally optimal labeling sequence, so as to output the crop disease entities. According to the entity category, the entities were divided as subjects and objects, which were then input into the disordered language model PERT to extract the contextual features of the relation data. At last, the fully connected layer was used to decode the information and output the crop disease entity-relation triples. The experiment results show that, on the self-built disease corpus dataset, the Precision, Recall, and Fl-Score values of the established model reached 85.63%, 79.46% and 82.43%, respectively, for entity extraction, and reached 97.96%, 98.43% and 98.16%, respectively, for relation extraction. This paper provides an effective method for information extraction in the construction of Chinese crop disease domain knowledge graphs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available