期刊
BIODATA MINING
卷 7, 期 -, 页码 -出版社
BMC
DOI: 10.1186/s13040-014-0028-y
关键词
Machine; Nodesize; Random forest; Trees; Synthetic feature
资金
- National Science Foundation [1148991]
- National Cancer Institute [R01CA163739]
- Intramural Research Program at the National Institutes of Health
- Direct For Mathematical & Physical Scien
- Division Of Mathematical Sciences [1148991] Funding Source: National Science Foundation
Background: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. Results: Using a large collection of regression and multiclass datasets we show that synthetic random forests outperforms both conventional random forests and the optimized forest from the regresssion portfolio. Conclusions: Synthetic forests removes the need for tuning random forests with no additional effort on the part of the researcher. Importantly, the synthetic forest does this with evidently no loss in prediction compared to a well-optimized single random forest.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据