期刊
MACHINE LEARNING
卷 101, 期 1-3, 页码 325-343出版社
SPRINGER
DOI: 10.1007/s10994-014-5452-1
关键词
Bias correction; Random forests; Quantile regression forests; High dimensional data; Data mining
资金
- Shenzhen New Industry Development Fund [JC201005270342A]
- project Some Advanced Statistical Learning Techniques for Computer Vision - National Foundation of Science and Technology Development, Vietnam [102.01-2011.17]
Quantile regression forests (QRF), a tree-based ensemble method for estimation of conditional quantiles, has been proven to perform well in terms of prediction accuracy, especially for range prediction. However, the model may have bias and suffer from working with high dimensional data (thousands of features). In this paper, we propose a new bias correction method, called bcQRF that uses bias correction in QRF for range prediction. In bcQRF, a new feature weighting subspace sampling method is used to build the first level QRF model. The residual term of the first level QRF model is then used as the response feature to train the second level QRF model for bias correction. The two-level models are used to compute bias-corrected predictions. Extensive experiments on both synthetic and real world data sets have demonstrated that the bcQRF method significantly reduced prediction errors and outperformed most existing regression random forests. The new method performed especially well on high dimensional data.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据