4.8 Article

A Feature Discretization Method Based on Fuzzy Rough Sets for High-Resolution Remote Sensing Big Data Under Linear Spectral Model

Journal

IEEE TRANSACTIONS ON FUZZY SYSTEMS
Volume 30, Issue 5, Pages 1328-1342

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TFUZZ.2021.3058020

Keywords

Remote sensing; Rough sets; Genetic algorithms; Uncertainty; Fuzzy sets; Sociology; Data models; Adaptive genetic algorithm (GA); discretization; fuzzy rough model; MapReduce framework; mixed pixels

Funding

  1. Hainan Provincial Natural Science Foundation of China [2019CXTD400]
  2. National Key Research and Development Program of China [2018YFB1404400]
  3. National Key R&D Program of China [2018YFB0804402]

Ask authors/readers for more resources

Discretization is an important data preprocessing technique in data mining, especially in industrial control. However, traditional discretization methods have shortcomings, particularly in the preprocessing of high-resolution remote sensing big data, where necessary information is lost. This study proposes a discretization method for high-resolution remote sensing big data, which determines the membership degree of each pixel using linear decomposition and a fuzzy rough model, and selects discrete breakpoints using an adaptive genetic algorithm. The method achieves optimal discretization scheme in the shortest time by parallel computing the individual fitness of the population using a MapReduce framework.
As one of the most relevant data preprocessing techniques, discretization has played an important role in data mining, which is widely applied in industrial control. It can transform continuous features to discrete ones, thus improving the efficiency of data processing and adapting to learning algorithms that require discrete data as inputs. However, traditional discretization methods have shortcomings, such as highly complex programs, excessive numbers of intervals obtained, and significant loss of necessary information in the preprocessing of high-resolution remote sensing big data. Moreover, the large number of mixed pixels in the image is a primary reason for the uncertainty of remote sensing information systems, and current discretization methods are based on the assumption that one pixel only corresponds to the spectral information of a single object, without considering the influence of the uncertainty caused by a mixed spectrum, which causes the classification accuracy to drop after discretization. We propose a discretization method for high-resolution remote sensing big data. We determine the membership degree of each pixel in training samples through linear decomposition and establish the individual fitness function based on a fuzzy rough model. An adaptive genetic algorithm selects discrete breakpoints, and a MapReduce framework calculates the individual fitness of the population in parallel to obtain the optimal discretization scheme in the minimum time. Our method is compared to the best state-of-the-art discretization algorithms on the authentic remote sensing datasets. Experiments verified the effectiveness of the proposed method, which provides strong support for the subsequent processing of images.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available