4.5 Article

Machine vision automated species identification scaled towards production levels

期刊

SYSTEMATIC ENTOMOLOGY
卷 41, 期 1, 页码 133-143

出版社

WILEY
DOI: 10.1111/syen.12146

关键词

-

向作者/读者索取更多资源

Computer-automated identification of insect species has long been sought to support activities such as environmental monitoring, forensics, pest diagnostics, border security and vector epidemiology, to name just a few. In order to succeed, an automated identification programme capable of addressing the needs of the end user should be able to classify hundreds of taxa, if not thousands, and is expected to distinguish closely related and hence morphologically similar species. However, it remains unknown how automated identification methods might handle an increase in data quantity, be it in reference imagery or taxonomic diversity. We sought to test the scalability of an automated identification method in terms of the number of reference specimens used to train the classifier and the number of taxa into which the classifier should assign unknown specimens. Is there an optimal number of reference images, where the cost of acquiring more images becomes greater than the marginal increase in identification success? Does increasing taxonomic diversity affect identification success, whether negatively or positively? In order to test the scalability of the automated insect identification enterprise, we used a sparse processing technique and support vector machine to test the largest dataset to date: 72 species of fruit flies (Diptera: Tephritidae) and 76 species of mosquitoes (Diptera: Culicidae). We found that: (i) machine vision methods are capable of correctly classifying large numbers of closely related species; (ii) when the misclassification of a specimen occurs at the species level, it is often classified in the correct genus; (iii) classification success increases asymptotically as new training images are added to the dataset; (iv) broad taxon sampling outside a focal group can increase classification success within it.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据