Journal
REMOTE SENSING
Volume 9, Issue 2, Pages -Publisher
MDPI
DOI: 10.3390/rs9020173
Keywords
class label noise; mislabeled training data; satellite image time series; classification; land cover mapping; Support Vector Machines; Random Forests
Categories
Funding
- French spatial agency (CNES)
- French mapping agency (IGN)
- University Paul Sabatier
- CNRS (Centre National de la Recherche Scientifique)
- CNES
- IRD (Institut de Recherche pour le Developpement)
- MATIS laboratory - IGN
Ask authors/readers for more resources
Supervised classification systems used for land cover mapping require accurate reference databases. These reference data come generally from different sources such as field measurements, thematic maps, or aerial photographs. Due to misregistration, update delay, or land cover complexity, they may contain class label noise, i.e., a wrong label assignment. This study aims at evaluating the impact of mislabeled training data on classification performances for land cover mapping. Particularly, it addresses the random and systematic label noise problem for the classification of high resolution satellite image time series. Experiments are carried out on synthetic and real datasets with two traditional classifiers: Support Vector Machines (SVM) and Random Forests (RF). A synthetic dataset has been designed for this study, simulating vegetation profiles over one year. The real dataset is composed of Landsat-8 and SPOT-4 images acquired during one year in the south of France. The results show that both classifiers are little influenced for low random noise levels up to 25%-30%, but their performances drop down for higher noise levels. Different classification configurations are tested by increasing the number of classes, using different input feature vectors, and changing the number of training instances. Algorithm complexities are also analyzed. The RF classifier achieves high robustness to random and systematic label noise for all the tested configurations; whereas the SVM classifier is more sensitive to the kernel choice and to the input feature vectors. Finally, this work reveals that the cross-validation procedure is impacted by the presence of class label noise.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available