4.7 Article

A hierarchical learning model for inferring the labels of points of interest with unbalanced data distribution

出版社

ELSEVIER
DOI: 10.1016/j.jag.2022.102751

关键词

Point of interest; Data label; Imbalanced learning; Data quality; Spatial inference

资金

  1. National Natural Science Foundation of China [42071442]
  2. Fundamental Research Funds for the Central Universities, China Uni-versity of Geosciences (Wuhan) [CUG170640]

向作者/读者索取更多资源

This paper proposes a neural network prediction method based on multi-level POI category organization, which improves prediction performance by aggregating small sample categories hierarchically while ensuring a balanced data volume.
The point of interest (POI) is a critical part of spatial database, which has been widely used in many fields, such as navigation, mapping and urban planning. The data quality of POIs (e.g., missing and incorrect labels) has a large impact on the effectiveness of geospatial applications, especially regarding the non-professional collection characteristics of OpenStreetMap (OSM) data. The conventional neural network model predicted multi-category data labels directly from a single level, and did not consider the uneven distribution of data among POI categories. The predicted labels tended to be the types with larger data volume, and it is difficult to generalize the small-scale samples. Taking into account the large difference in the data volume among POI categories, this paper proposes a neural network prediction method based on multi-level POI category organization. Through the hierarchical aggregation of small sample categories, we established a POI category tree structure, which achieved a relatively balanced division of data volume at different levels of the tree structure. Specifically, the proposed method first roughly classified the POIs at an abstraction level and then inferred the detailed labels according to the hierarchy of the tree structure. We conducted extensive experiments on two datasets and the results demonstrated that our method outperformed traditional methods by a large margin.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据