☆ 4.7 Article

Improving predictive performance on survival in dairy cattle using an ensemble learning approach

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2020)

期刊

COMPUTERS AND ELECTRONICS IN AGRICULTURE

卷 177, 期 -, 页码 -

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.compag.2020.105675

关键词

Ensemble; Machine learning; Survival; Dairy cow

类别

Agriculture, Multidisciplinary Computer Science, Interdisciplinary Applications

资金

Netherlands Organization for Scientific Research (NWO) [14295]
Dutch Ministry of Economic Affairs (TKI Agri Food project) [14295, 12018]
Cobb Europe [14295]
CRV [14295]
Hendrix Genetics [14295]
Topigs Norsvin [14295]
GenTORE from European Community's H2020 Framework Programme - GenTORE [727213]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Cow survival is a complex trait that combines traits like milk production, fertility, health and environmental factors such as farm management. This complexity makes survival difficult to predict accurately. This is probably the reason why few studies attempted to address this problem and no studies are published that use ensemble methods for this purpose. We explored if we could improve prediction of cow survival to second lactation, when predicted at five different moments in a cow's life, by combining the predictions of multiple (weak) methods in an ensemble method. We tested four ensemble methods: majority voting rule, multiple logistic regression, random forest and naive Bayes. Precision, recall, balanced accuracy, area under the curve (AUC) and gains in proportion of surviving cows in a scenario where the best 50% were selected were used to evaluate the ensemble model performance. We also calculated correlations between the ensemble models and obtained McNemar's test statistics. We compared the performance of the ensemble methods against those of the individual methods. We also tested if there was a difference in performance metrics when continuous (from 0 to 1) and binary (0 or 1) prediction outcomes were used. In general, using continuous prediction output resulted in higher performance metrics than binary ones. AUCs for models ranged from 0.561 to 0.731, with generally increasing performance at moments later in life. Precision, AUC and balanced accuracy values improved significantly for the naive Bayes and multiple logistic regression ensembles in at least one data set, although performance metrics did remain low overall. The multiple logistic regression ensemble method resulted in equal or better precision, AUC, balanced accuracy and proportion of animals surviving on all datasets and was significantly different from the other ensembles in three out of five moments. The random forest ensemble method resulted in the least significant improvement over the individual methods.

Improving predictive performance on survival in dairy cattle using an ensemble learning approach

期刊

COMPUTERS AND ELECTRONICS IN AGRICULTURE

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Improving predictive performance on survival in dairy cattle using an ensemble learning approach

期刊

COMPUTERS AND ELECTRONICS IN AGRICULTURE

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文