4.6 Article

Global Flood Disaster Research Graph Analysis Based on Literature Mining

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/app12063066

Keywords

flood disaster; research hotspot; literature mining; natural language processing; knowledge graph

Funding

  1. National Natural Science Foundation of China [42050105]
  2. Chinese Academy of Sciences [ZDRW-XH-2021-3]
  3. Construction Project of the China Knowledge Center for Engineering Sciences and Technology [CKCEST-2021-2-18]

Ask authors/readers for more resources

This study utilizes natural language process technology to mine research trends and hotspots on flood disasters, and analyzes their quantitative and spatial distribution features. The research shows that the current focus of flood disaster research is on risk and prediction, and the global and intercontinental distribution of research is geographically imbalanced. A flood disaster knowledge graph has been constructed to provide more information on flood disaster risk and reduction.
Floods are the most frequent and highest-impact among the natural disasters caused by global climate change. A large number of flood disaster knowledge were buried in the scientific literature. This study mines research trends and hotspots on flood disasters and identifies their quantitative and spatial distribution features using natural language process technology. The abstracts of 14,076 studies related to flood disasters from 1990 to 2020 were used for text mining. The study used logistic regression to classify themes, adopted the dictionary matching method to analyze flood disaster subcategories, analyzed the spatial distribution characteristics of research institutions, and used Stanford named entity recognition to identify hot research areas. Finally, the disaster information was integrated and visualized as a knowledge graph. The main findings are as follows. (1) The research hotspots are concentrated on flood disaster risks and prediction. Rainfall, coastal floods, and flash floods are the most-studied flood disaster sub-categories. (2) There are some connections and differences between the physical occurrence and research frequency of flood disasters. Occurrence frequency and research frequency of flood disasters are correlated. However, the spatial distribution at the global and intercontinental scales is geographically imbalanced. (3) The study's flood disaster knowledge graph contains 39,679 nodes and 64,908 edges, reflecting the literature distribution and field information on the research themes. Future research will extract more disaster information from the full texts of the studies to enrich the flood disaster knowledge graph and obtain more knowledge on flood disaster risk and reduction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available