4.5 Article

Disturbed-entropy: A simple data quality assessment approach

期刊

ICT EXPRESS
卷 8, 期 3, 页码 309-312

出版社

ELSEVIER
DOI: 10.1016/j.icte.2022.01.006

关键词

Neural network; Data-centric computing; Information entropy

资金

  1. National Natural Science Foundation of China [32101612]
  2. Major Science and Technology Projects of Xinjiang Production and Construction Corps [2021AA006]

向作者/读者索取更多资源

From the perspective of information value, a simple and effective approach called disturbed-entropy is proposed to assess data quality. The existing samples per category in image classification task are statistically represented as a pixel prototype, which is used to disturb the unseen samples. Then, the entropy of disturbed image is calculated based on predicted probability. Numerical and visual experiments are conducted to show the effectiveness of the approach.
From the perspective of information value, we proposed a simple and effective approach to assess data quality, called disturbed-entropy. In specific, considering image classification task, the existing samples per category are statistically represented as a pixel prototype, which is used to disturb the unseen samples. Then, the entropy of disturbed image is calculated based on predicted probability. Both the numerical and visual experiments are conducted to show the effect. In case of same data budget, the performance comparison based on selected good and bad data is significant and consistent. This work attempts to gain insight into data quality and redundancy. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of The Korean Institute of Communications and Information Sciences.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据