4.7 Article

Deformable Kernel Networks for Joint Image Filtering

Journal

INTERNATIONAL JOURNAL OF COMPUTER VISION
Volume 129, Issue 2, Pages 579-600

Publisher

SPRINGER
DOI: 10.1007/s11263-020-01386-z

Keywords

Joint filtering; Convolutional neural networks; Depth map upsampling; Cross-modality image restoration; Texture removal; Semantic segmentation

Funding

  1. Samsung Research Funding & Incubation Center for Future Technology [SRFC-IT1802-06]
  2. Louis Vuitton/ENS chair on artificial intelligence
  3. Inria/NYU collaboration agreement
  4. French government under Agence Nationale de la Recherche as part of the Investissements d'avenir program [ANR-19-P3IA-0001]

Ask authors/readers for more resources

In this paper, a new CNN architecture and its efficient implementation, called the deformable kernel network (DKN), is proposed for image filtering. By adaptively outputting sets of neighboring pixels and corresponding weights for each pixel, the filtering result is computed as a weighted average, demonstrating superior performance in various tasks.
Joint image filters are used to transfer structural details from a guidance picture used as a prior to a target image, in tasks such as enhancing spatial resolution and suppressing noise. Previous methods based on convolutional neural networks (CNNs) combine nonlinear activations of spatially-invariant kernels to estimate structural details and regress the filtering result. In this paper, we instead learn explicitly sparse and spatially-variant kernels. We propose a CNN architecture and its efficient implementation, called the deformable kernel network (DKN), that outputs sets of neighbors and the corresponding weights adaptively for each pixel. The filtering result is then computed as a weighted average. We also propose a fast version of DKN that runs about seventeen times faster for an image of size 640x480. We demonstrate the effectiveness and flexibility of our models on the tasks of depth map upsampling, saliency map upsampling, cross-modality image restoration, texture removal, and semantic segmentation. In particular, we show that the weighted averaging process with sparsely sampled 3 x 3 kernels outperforms the state of the art by a significant margin in all cases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available