4.6 Article

Data driven identification of international cutting edge science and technologies using SpaCy

Journal

PLOS ONE
Volume 17, Issue 10, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0275872

Keywords

-

Funding

  1. National Social Science Fund of China [20BTQ056]
  2. Gong (Huaping Gong)
  3. National Natural Science Foundation of China [72163021]

Ask authors/readers for more resources

This paper proposes a data-driven model for identifying global cutting-edge science technologies, and the experimental results show that this model performs well in entity recognition tasks. The model provides an information source for cutting-edge technology identification and promotes innovation and exploration of more efficient scientific and technological research work modes.
Difficulties in collecting, processing, and identifying massive data have slowed research on cutting-edge science and technology hotspots. Promoting these technologies will not be successful without an effective data-driven method to identify cutting-edge technologies. This paper proposes a data-driven model for identifying global cutting-edge science technologies based on SpaCy. In this model, we collected data released by 17 well-known American technology media websites from July 2019 to July 2020 using web crawling with Python. We combine graph-based neural network learning with active learning as the research method in this paper. Next, we introduced a ten-fold cross-check to train the model through machine learning with repeated experiments. The experimental results show that this model performed very well in entity recognition tasks with an F value of 98.11%. The model provides an information source for cutting-edge technology identification. It can promote innovations in cutting-edge technologies through its effective identification and tracking and explore more efficient scientific and technological research work modes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available