4.7 Article

Effect of Training Class Label Noise on Classification Performances for Land Cover Mapping with Satellite Image Time Series

Journal

REMOTE SENSING
Volume 9, Issue 2, Pages -

Publisher

MDPI
DOI: 10.3390/rs9020173

Keywords

class label noise; mislabeled training data; satellite image time series; classification; land cover mapping; Support Vector Machines; Random Forests

Funding

  1. French spatial agency (CNES)
  2. French mapping agency (IGN)
  3. University Paul Sabatier
  4. CNRS (Centre National de la Recherche Scientifique)
  5. CNES
  6. IRD (Institut de Recherche pour le Developpement)
  7. MATIS laboratory - IGN

Ask authors/readers for more resources

Supervised classification systems used for land cover mapping require accurate reference databases. These reference data come generally from different sources such as field measurements, thematic maps, or aerial photographs. Due to misregistration, update delay, or land cover complexity, they may contain class label noise, i.e., a wrong label assignment. This study aims at evaluating the impact of mislabeled training data on classification performances for land cover mapping. Particularly, it addresses the random and systematic label noise problem for the classification of high resolution satellite image time series. Experiments are carried out on synthetic and real datasets with two traditional classifiers: Support Vector Machines (SVM) and Random Forests (RF). A synthetic dataset has been designed for this study, simulating vegetation profiles over one year. The real dataset is composed of Landsat-8 and SPOT-4 images acquired during one year in the south of France. The results show that both classifiers are little influenced for low random noise levels up to 25%-30%, but their performances drop down for higher noise levels. Different classification configurations are tested by increasing the number of classes, using different input feature vectors, and changing the number of training instances. Algorithm complexities are also analyzed. The RF classifier achieves high robustness to random and systematic label noise for all the tested configurations; whereas the SVM classifier is more sensitive to the kernel choice and to the input feature vectors. Finally, this work reveals that the cross-validation procedure is impacted by the presence of class label noise.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available