☆ 4.7 Article

On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval

REMOTE SENSING (2021)

期刊

REMOTE SENSING

卷 13, 期 2, 页码 -

出版社

MDPI

DOI: 10.3390/rs13020326

关键词

total cloud cover; all-sky camera; algorithms assessment; neural networks; machine learning; data-driven approach

类别

Environmental Sciences Geosciences, Multidisciplinary Remote Sensing Imaging Science & Photographic Technology

资金

Russian Ministry of Science and Higher Education [No05.616.21.0112, RFMEFI61619X0112]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study discusses the optimization of data-driven schemes for Total Cloud Cover (TCC) retrieval from ground-based optical imagery, proposing new algorithms based on deep learning techniques. The evaluation of these algorithms is based on a dataset containing over one million all-sky optical images from various ocean regions, demonstrating the superiority of convolutional neural networks over previously published approaches.

Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the design of the algorithms, to the assessment of their generalization ability, and to the assessment of the TCC retrieval quality. In this study, we discuss the optimization nature of data-driven schemes for TCC retrieval. In order to compare the algorithms, we propose a framework for the assessment of the algorithms' characteristics. We present several new algorithms that are based on deep learning techniques: A model for outliers filtering, and a few models for TCC retrieval from all-sky imagery. For training and assessment of data-driven algorithms of this study, we present the Dataset of All-Sky Imagery over the Ocean (DASIO) containing over one million all-sky optical images of the visible sky dome taken in various regions of the world ocean. The research campaigns that contributed to the DASIO collection took place in the Atlantic ocean, the Indian ocean, the Red and Mediterranean seas, and the Arctic ocean. Optical imagery collected during these missions are accompanied by standard meteorological observations of cloudiness characteristics made by experienced observers. We assess the generalization ability of the presented models in several scenarios that differ in terms of the regions selected for the train and test subsets. As a result, we demonstrate that our models based on convolutional neural networks deliver a superior quality compared to all previously published approaches. As a key result, we demonstrate a considerable drop in the ability to generalize the training data in the case of a strong covariate shift between the training and test subsets of imagery which may occur in the case of region-aware subsampling.

On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval

期刊

REMOTE SENSING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

On the Generalization Ability of Data-Driven Models in the Problem of Total Cloud Cover Retrieval

期刊

REMOTE SENSING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文