☆ 4.5 Article

Efficient ensembles of distance-based label ranking trees

EXPERT SYSTEMS (2023)

期刊

EXPERT SYSTEMS

卷 -, 期 -, 页码 -

出版社

WILEY

DOI: 10.1111/exsy.13525

关键词

ensemble methods; generalized Kendall distance; label ranking; machine learning; preference learning

类别

Computer Science, Artificial Intelligence Computer Science, Theory & Methods

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article proposes two alternative methods to improve the label ranking trees (LRTs) algorithm. These methods use distance-based criteria to select the best split at each node and can handle incomplete rankings efficiently. Experimental results show that the proposed methods are significantly faster and at least as accurate as the original Mallows-based LRT algorithm.

Ensemble of label ranking trees (LRTs) are currently the state-of-the-art approaches to the label ranking problem. Recently, bagging, boosting, and random forest methods have been proposed, all based on the LRT algorithm, which adapts regression/classification trees to the label classification problem. The LRT algorithm uses theoretically grounded Mallows probability distribution to select the best split when growing the tree, and an EM-type process to complete the rankings on the training data when they are incomplete. These two steps have proven to be accurate, but require a large computational effort. This article proposes two alternative methods that replace the use of the Mallows distribution with distance-based criteria to select the best split at each inner node of the tree. Moreover, these distance-based criteria allow dealing with incomplete rankings natively, so avoiding the completion process. We have carried out an extensive experimental evaluation, which shows that (1) the integration of the two proposed modifications to the LRT algorithm into ensemble methods (bagging and random forest) are an order of magnitude faster than using the original Mallows-based LRT algorithm; (2) ensembles using the proposed LRT methods are significantly more accurate in the presence of incomplete rankings, while they are at least as accurate in the complete case; and (3) the two modified LRT algorithms are also an order of magnitude faster than the Mallows-based LRT, while they are at least as accurate as the Mallows-based LRT on both complete and incomplete rankings.

Efficient ensembles of distance-based label ranking trees

期刊

EXPERT SYSTEMS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Efficient ensembles of distance-based label ranking trees

期刊

EXPERT SYSTEMS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文