☆ 4.4 Article

CMLocate: A cross-modal automatic visual geo-localization framework for a natural environment without GNSS information

IET IMAGE PROCESSING (2023)

期刊

IET IMAGE PROCESSING

卷 -, 期 -, 页码 -

出版社

WILEY

DOI: 10.1049/ipr2.12883

关键词

computer vision; image processing; image retrieval; image segmentation

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, a new approach to visual geo-localization for natural environments is proposed by creating a panoramic skyline database using digital elevation model (DEM) data in virtual space. The combination of the skyline database and real-world image data enables visual geo-localization as a cross-modal image retrieval problem. The paper introduces the LineNet semantic segmentation model for skyline extractions, which has proven to be robust in complex natural environments. Additionally, a compound index is designed to reduce storage space and improve retrieval efficiency, resulting in the proposed method outperforming most state-of-the-art methods.

In this paper, a new approach to visual geo-localization for natural environments is proposed. The digital elevation model (DEM) data in virtual space is rendered and construct a panoramic skyline database is constructed. By combining the skyline database with real-world image data (used as the queries to be localized), visual geo-localization is treated as a cross-modal image retrieval problem for panoramic skyline images, creating a unique new visual geo-localization benchmark for the natural environment. Specifically, the semantic segmentation model named LineNet is proposed, for skyline extractions from query images, which has proven to be robust to a variety of complex natural environments. On the aforementioned benchmarks, the fully automatic method is elaborated for large-scale cross-modal localization using panoramic skyline images. Finally, the compound index is delicately designed to reduce the storage space of the positioning global descriptors and improve the retrieval efficiency. Moreover, the proposed method is proven to outperform most state-of-the-art methods.

CMLocate: A cross-modal automatic visual geo-localization framework for a natural environment without GNSS information

期刊

IET IMAGE PROCESSING

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

CMLocate: A cross-modal automatic visual geo-localization framework for a natural environment without GNSS information

期刊

IET IMAGE PROCESSING

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文