4.1 Article

Reconstruction of incomplete wildfire data using deep generative models

Journal

EXTREMES
Volume 26, Issue 2, Pages 251-271

Publisher

SPRINGER
DOI: 10.1007/s10687-022-00459-1

Keywords

Data reconstruction; Variational autoencoder; Convolutional neural network; Deep learning; Ensemble; Extreme Value Analysis Conference challenge; Wildfires

Ask authors/readers for more resources

We submitted our solution to the Extreme Value Analysis 2021 Data Challenge, aiming to accurately predict distributions of wildfire frequency and size in missing spatio-temporal regions. To achieve this, we developed a variant of variational autoencoder models called Conditional Missing data Importance-Weighted Autoencoder (CMIWAE). Our deep latent variable generative model requires minimal feature engineering and is not tied to the specific scoring in the Data Challenge. It is trained on incomplete data, with the sole objective of maximizing the log-likelihood of observed wildfire information. We alleviate the impact of the relatively low number of training samples by employing stochastic sampling from a variational latent variable distribution and ensembling CMIWAE models trained and validated on different data splits.
We present our submission to the Extreme Value Analysis 2021 Data Challenge in which teams were asked to accurately predict distributions of wildfire frequency and size within spatio-temporal regions of missing data. For this competition, we developed a variant of the powerful variational autoencoder models, which we call Conditional Missing data Importance-Weighted Autoencoder (CMIWAE). Our deep latent variable generative model requires little to no feature engineering and does not necessarily rely on the specifics of scoring in the Data Challenge. It is fully trained on incomplete data, with the single objective to maximize log-likelihood of the observed wildfire information. We mitigate the effects of the relatively low number of training samples by stochastic sampling from a variational latent variable distribution, as well as by ensembling a set of CMIWAE models trained and validated on different splits of the provided data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available