4.6 Article

Secure and Efficient Federated Gradient Boosting Decision Trees

期刊

APPLIED SCIENCES-BASEL
卷 13, 期 7, 页码 -

出版社

MDPI
DOI: 10.3390/app13074283

关键词

horizontal federated learning; data security; gradient boosting decision tree; privacy protection; LSH

向作者/读者索取更多资源

In recent years, federated GBDTs have replaced traditional GBDTs and become the focus of academic research for structured data mining. This paper proposes a secure and efficient FL algorithm for GBDTs (SeFB) based on horizontal federated learning, addressing the issues of information leakage, model accuracy, and communication cost. Experimental analysis shows that the algorithm protects data privacy and reduces communication cost effectively.
In recent years, federated GBDTs have gradually replaced traditional GBDTs, and become the focus of academic research. They are used to solve the task of structured data mining. Aiming at the problems of information leakage, insufficient model accuracy and high communication cost in the existing schemes of horizontal federated GBDTs, this paper proposes an algorithm of gradient boosting decision trees based on horizontal federated learning, that is, secure and efficient FL for GBDTs (SeFB). The algorithm uses locality sensitive hashing (LSH) to build a tree by collecting similar information of instances without exposing the original data of participants. In the stage of updating the tree, the algorithm aggregates the local gradients of all data participants and calculates the global leaf weights, so as to improve the accuracy of the model and reduce the communication cost. Finally, the experimental analysis shows that the algorithm can protect the privacy of the original data, and the communication cost is low. At the same time, the performance of the unbalanced binary data set is evaluated. The results show that SeFB algorithm compared with the existing schemes of horizontal federated GBDTs, the accuracy is improved by 2.53% on average.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据