4.7 Article

Estimation of missing values in heterogeneous traffic data: Application of multimodal deep learning model

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 194, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2020.105592

Keywords

Autoencoder; Feature fusion; Deep learning; Traffic missing data; Imputation

Funding

  1. Shenzhen Science and Technology program, China [KQTD20180412181337494]
  2. National Natural Science Foundation of China [51822802, 51778033, U1811463]

Ask authors/readers for more resources

With the development of sensing technology, a large amount of heterogeneous traffic data can be collected. However, the raw data often contain corrupted or missing values, which need to be imputed to aid traffic condition monitoring and the assessment of the system performance. Several existing studies have reported imputation models used to impute the missing values, and most of these models aimed to capture the spatial or temporal dependencies. However, the dependencies of the heterogeneous data were ignored. To this end, we propose a multimodal deep learning model to enable heterogeneous traffic data imputation. The model involves the use of two parallel stacked autoencoders that can simultaneously consider the spatial and temporal dependencies. In addition, a latent feature fusion layer is developed to capture the dependencies of the heterogeneous traffic data. To train the proposed imputation model, a hierarchical training method is introduced. Using a real world dataset, the performance of the proposed model is evaluated and compared with that of several widely used temporal imputation models, spatial imputation models, and spatial-temporal imputation models. The experimental and evaluation results indicate that the values of the evaluation criteria of the proposed model are smaller, indicating a better performance. The results also show that the proposed model can accurately impute the continuously missing data. Furthermore, the sensitivity of the parameters used in the proposed deep multimodal deep learning model is investigated. This study clearly demonstrates the effectiveness of deep learning for heterogeneous traffic data synthesis and missing data imputation. The dependencies of the heterogeneous traffic data should be considered in future studies to improve the performance of the imputation model. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available