☆ 4.6 Article

Automated retrieval of information on threatened species from online sources using machine learning

METHODS IN ECOLOGY AND EVOLUTION (2021)

期刊

METHODS IN ECOLOGY AND EVOLUTION

卷 12, 期 7, 页码 1226-1239

出版社

WILEY

DOI: 10.1111/2041-210X.13608

关键词

biodiversity; digital conservation; machine learning; natural language processing; wildlife trade

类别

Ecology

资金

European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme [802933]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study demonstrates the successful application of natural language processing to extract information from digital text content, showcasing the potential for investigating human-nature interactions in conservation science and practice. The automated methods developed can be applied to multiple digital data platforms simultaneously, offering a cost-efficient and effective approach to addressing global biodiversity crisis.

1. As resources for conservation are limited, gathering and analysing information from digital platforms can help investigate the global biodiversity crisis in a cost-efficient manner. Development and application of methods for automated content analysis of digital data sources are especially important in the context of investigating human-nature interactions. 2. In this study, we introduce novel application methods to automatically collect and analyse textual data on species of conservation concern from digital platforms. An end-to-end pipeline is constructed that begins from searching and downloading news articles about species listed in Appendix I of the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) along with news articles from specific Twitter handles and proceeds with implementing natural language processing and machine learning methods to filter and retain only relevant articles. A crucial aspect here is the automatic annotation of training data, which can be challenging in many machine learning applications. A Named Entity Recognition model is then used to extract additional relevant information for each article. 3. The data collected over a 1-month period included 15,088 articles focusing on 585 species listed in Appendix I of CITES. The accuracy of the neural network to detect relevant articles was 95.91% while the Named Entity recognition model helped extract information on prices, location and quantities of traded animals and plants. A regularly updated database, which can be queried and analysed for various research purposes and to inform conservation decision making, is generated by the system. 4. The results demonstrate that natural language processing can be used successfully to extract information from digital text content. The proposed methods can be applied to multiple digital data platforms at the same time and used to investigate human-nature interactions in conservation science and practice.

Automated retrieval of information on threatened species from online sources using machine learning

期刊

METHODS IN ECOLOGY AND EVOLUTION

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Automated retrieval of information on threatened species from online sources using machine learning

期刊

METHODS IN ECOLOGY AND EVOLUTION

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文