4.4 Article

Image-level dataset synthesis with an end-to-end trainable framework

期刊

IET IMAGE PROCESSING
卷 16, 期 8, 页码 2228-2239

出版社

WILEY
DOI: 10.1049/ipr2.12486

关键词

-

资金

  1. National Natural Science Foundation of China [61633019]

向作者/读者索取更多资源

This paper proposes a method to simulate synthetic datasets from a fine-grained perspective, enabling the system to be trained in an end-to-end manner. By converting it into an image-level data synthesis problem and training a generative model to approximate the rendering process, the whole framework becomes fully differentiable, allowing efficient optimization of attributes.
Dataset synthesis via virtual engines like Unity is attracting much more attention in recent years due to its low cost at obtaining ground-truth labels. For this kind of work, virtual environments are constructed within the engine to mimic the real-world, either with great manual efforts or learning-based methods. The latter shows superiority over the former when the target real-world scenes are changeable, from which the attributes of environments can be automatically adjusted based on the distribution difference between the synthetic and real-world datasets. However, the non-differentiability of whole pipeline hinders the efficiency of attribute optimization. To this end, this paper proposes to simulate synthetic datasets from a fine-grained perspective, such that the system can be trained at an end-to-end manner. Specifically, it is converted into an image-level data synthesis problem, and designs a constraint using the content loss between two images. As the rendering process of virtual engine is mathematically unknown, which blocks the back propagation of the gradients, a generative model is trained to approximate the engine. As a result, the whole framework becomes fully differentiable and the attributes can be optimized efficiently by gradient descent. Experimental result shows the efficiency of our method in obtaining useful synthetic training datasets. Besides, it is found that the image-level method enables to learn the potential distribution of real-world data, which is hard to be achieved by existing methods. As far as we know, it is the first attempt to finish this task with a differentiable process.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据