期刊
ICT EXPRESS
卷 8, 期 3, 页码 309-312出版社
ELSEVIER
DOI: 10.1016/j.icte.2022.01.006
关键词
Neural network; Data-centric computing; Information entropy
资金
- National Natural Science Foundation of China [32101612]
- Major Science and Technology Projects of Xinjiang Production and Construction Corps [2021AA006]
From the perspective of information value, a simple and effective approach called disturbed-entropy is proposed to assess data quality. The existing samples per category in image classification task are statistically represented as a pixel prototype, which is used to disturb the unseen samples. Then, the entropy of disturbed image is calculated based on predicted probability. Numerical and visual experiments are conducted to show the effectiveness of the approach.
From the perspective of information value, we proposed a simple and effective approach to assess data quality, called disturbed-entropy. In specific, considering image classification task, the existing samples per category are statistically represented as a pixel prototype, which is used to disturb the unseen samples. Then, the entropy of disturbed image is calculated based on predicted probability. Both the numerical and visual experiments are conducted to show the effect. In case of same data budget, the performance comparison based on selected good and bad data is significant and consistent. This work attempts to gain insight into data quality and redundancy. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of The Korean Institute of Communications and Information Sciences.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据