4.6 Article

A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net

期刊

SENSORS
卷 21, 期 8, 页码 -

出版社

MDPI
DOI: 10.3390/s21082803

关键词

power lines; semantic segmentation; Matthews correlation coefficient; loss function; data imbalance

资金

  1. Department of Computer and Information Sciences and High-Performance Cloud Computing Centre (HPC3), Universiti Teknologi PETRONAS
  2. Kyushu Institute of Technology, Japan
  3. Universiti Teknologi PETRONAS, Malaysia

向作者/读者索取更多资源

This study proposes a generalized focal loss function based on the Matthews correlation coefficient or the Phi coefficient to address the class imbalance problem in power line segmentation using a generic deep segmentation architecture. Evaluation on two power line datasets shows that the proposed loss function outperforms the popular BBCE loss in dice scores, precision, and false detection rate values. Additionally, the ACU-Net model improves upon the baseline U-Net for evaluation parameters in the range of 1-10% for both datasets, achieving an optimal trade-off without any additional complexity.
The segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance present in the data. The PLs occupy only a minimal portion (1-5%) of the aerial images as compared to the background region (95-99%). Generally, this class imbalance problem is addressed via the use of PL-specific detectors in conjunction with the popular class balanced cross entropy (BBCE) loss function. However, these PL-specific detectors do not work outside their application areas and a BBCE loss requires hyperparameter tuning for class-wise weights, which is not trivial. Moreover, the BBCE loss results in low dice scores and precision values and thus, fails to achieve an optimal trade-off between dice scores, model accuracy, and precision-recall values. In this work, we propose a generalized focal loss function based on the Matthews correlation coefficient (MCC) or the Phi coefficient to address the class imbalance problem in PL segmentation while utilizing a generic deep segmentation architecture. We evaluate our loss function by improving the vanilla U-Net model with an additional convolutional auxiliary classifier head (ACU-Net) for better learning and faster model convergence. The evaluation of two PL datasets, namely the Mendeley Power Line Dataset and the Power Line Dataset of Urban Scenes (PLDU), where PLs occupy around 1% and 2% of the aerial images area, respectively, reveal that our proposed loss function outperforms the popular BBCE loss by 16% in PL dice scores on both the datasets, 19% in precision and false detection rate (FDR) values for the Mendeley PL dataset and 15% in precision and FDR values for the PLDU with a minor degradation in the accuracy and recall values. Moreover, our proposed ACU-Net outperforms the baseline vanilla U-Net for the characteristic evaluation parameters in the range of 1-10% for both the PL datasets. Thus, our proposed loss function with ACU-Net achieves an optimal trade-off for the characteristic evaluation parameters without any bells and whistles. Our code is available at Github.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据