4.7 Article

Efficiency of the t-distribution stochastic neighbor embedding technique for detailed visualization and modeling interactions between agricultural soil quality indicators

Journal

BIOSYSTEMS ENGINEERING
Volume 210, Issue -, Pages 282-298

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.biosystemseng.2021.08.033

Keywords

Kohonen self-organizing map; neural network; Botswana; Arenosols; k-means clustering; t-SNE

Ask authors/readers for more resources

The study compared the performance of KSOM-NN and t-SNE in characterizing soil quality indicators in agricultural dryland, with t-SNE showing superior visualization capabilities. Strong positive associations were found between some soil quality indicators, and the results have implications for agricultural soil management and decision-making. The study also provides preliminary findings for verifying detailed aspects of soil biogeochemistry in the future.
Dimensionality reduction is important for revealing important details that may be useful in decision-making. Although different dimensionality reduction methods have been applied in several soil-based studies, Kohonen self-organizing map neural network (KSOM-NN) has attracted significant attention from researchers because of the quality of data visualization and interpretation. However, there is a dearth of studies that compare KSOM-NN and other robust data reduction techniques such as the t-distribution stochastic neighbor embedding (t-SNE) method to improve visualization and interpretation of the relationships between soil quality indicators in agricultural soil. This study compares the above-mentioned methods for characterizing soil quality indicators including particle size distribution, soil organic matter (SOM), cation exchange capacity (CEC), soil reaction (pH), electrical conductivity (EC), zinc (Zn), iron (Fe), manganese (Mn), potassium (K) and phosphorus (P) in agricultural dryland. There were strongly positive associations identified between some of the variables studied for example, clay/Fe (r = 0.95), clay/SOM (r = 0.79) and Mn/Zn (r = 0.90) based on the correlation matrix output. According to the KSOM-NN, the best map size was a 4 by 7 with Quantization error (QE) = 0.108, Topographic error (TE) = 0.875 and KaskiLagus error (K-LE) = 9.104. This map only yielded 2 main clusters. As for the t-SNE, applying various perplexity values (i.e. 5, 6, 7, 8, 9 and 10) enabled better visualization of the soil quality indicators as observed from the cluster formations than the KSOM-NN. Ultimately, the t-SNE was considered a better and promising method for assessing the interactions of soil quality indicators and has the potential to appropriately visualise as well as improve the interpretability of soil results by identifying essential features and similarities. The results of this study have good implications for agricultural soil management and decision-making. Moreover, the results are preliminary findings that may be used in the future to verify detailed aspects of soil biogeochemistry within the study area. (c) 2021 IAgrE. Published by Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available