4.6 Article

Real-time semantic segmentation via sequential knowledge distillation

Journal

NEUROCOMPUTING
Volume 439, Issue -, Pages 134-145

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2021.01.086

Keywords

Semantic segmentation; Spatial refinement constraint; Adversarial network; Knowledge distillation

Funding

  1. National Natural Science Foundation of China [U1705262, 61772443, 61572410, 61802324, 61702136]
  2. National Key RD Program [2017YFC0113000, 2016YFB1001503]
  3. Key R&D Program of Jiangxi Province [20171ACH80022]
  4. Natural Science Foundation of Guangdong Province in China [2019B1515120049]

Ask authors/readers for more resources

This paper introduces a novel Sequential Prediction Network (SPNet) with Spatial Semantic and Edge Loss (SEL) and an adversarial network to achieve high segmentation accuracy in real-time applications. By utilizing a knowledge distillation scheme, the method effectively compresses structured knowledge from cumbersome networks, achieving promising results.
Deep model-based semantic segmentation has received ever increasing research focus in recent years. However, due to the complex model architectures, existing works are still unable to achieve high accuracy in real-time applications. In this paper, we propose a novel Sequential Prediction Network (termed SPNet) to seek a better trade-off between accuracy and efficiency. SPNet is also an end-to-end encoder-decoder architecture, which introduces a sequential prediction method to spread the contextual information from the low-level layers to the high-level layers. Besides, the proposed method is equipped with a stream Spatial Semantic and Edge Loss (termed SEL) and an adversarial network at multiple resolutions, which greatly improves the segmentation accuracy with a negligible increase in computation cost. To further uti-lize the extra unlabeled data, we present a knowledge distillation scheme to distill the structured knowl-edge from cumbersome to compact networks. Without using any pre-trained model, our method achieves state-of-the-art performance among exiting real-time segmentation models on several challenging data-sets. Impressively, on the Cityscapes test dataset, it obtains 75.8% mIoU at a speed of 61.2 FPS. (c) 2021 Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available