4.7 Article Proceedings Paper

Use of Random forest in the identification of important variables

Journal

MICROCHEMICAL JOURNAL
Volume 145, Issue -, Pages 1129-1134

Publisher

ELSEVIER
DOI: 10.1016/j.microc.2018.12.028

Keywords

Crude oil; H-1 NMR; C-13 NMR; Variables selection; Random Forest

Funding

  1. FAPES [33530.503.20537.12092017]
  2. CAPES
  3. CNPq [422515/2016-7]

Ask authors/readers for more resources

Random Forest (RF) technique has been shown to be promising in the supervised classification applied in different matrices. However, approaches to identifying significant variables that weight the model are scarce, in the classification problems. In this paper, we propose a methodology for the selection of variables of greater relevance in the construction of RF models. For the application of this methodology, classification models were developed to discriminating crude oil samples, about to their maximum pour point (MPP). In this sense, data from MPP (ASTM D5853) of 105 crude oil samples, their hydrogen (H-1) NMR spectra and carbon (C-13) NMR spectra were acquired. With MPP ranging from -54 degrees C to 39 degrees C, two classes were assigned: the first containing 43 samples with MPP value <= -9 degrees C, and, the second, 62 samples with MPP value > -9 degrees C. The H-1 NMR models, with 90% accuracy, and C-13 NMR, with 71% accuracy, were used in the selection of variable method. The results showed that the methodology proposed to select variables was effective in the distinction of the variables that best contributed to the discrimination of oils. Therefore, this new tool enabled a greater understanding of the interest chemical information, contained in the spectra and its relationship with the MPP property of the crude oil samples.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available