期刊
SENSORS
卷 21, 期 5, 页码 -出版社
MDPI
DOI: 10.3390/s21051573
关键词
audio sound classification; image classification; clustering; prototype selection; siamese network; dissimilarity space
资金
- NVIDIA Corporation
The image classification system proposed in this study utilizes Siamese Neural Networks to generate dissimilarity spaces, calculates centroids with k-means clustering, and classifies images using SVMs. The system performs competitively on medical and animal audio data sets, achieving state-of-the-art performance without ad-hoc optimization of clustering methods on tested data sets.
Traditionally, classifiers are trained to predict patterns within a feature space. The image classification system presented here trains classifiers to predict patterns within a vector space by combining the dissimilarity spaces generated by a large set of Siamese Neural Networks (SNNs). A set of centroids from the patterns in the training data sets is calculated with supervised k-means clustering. The centroids are used to generate the dissimilarity space via the Siamese networks. The vector space descriptors are extracted by projecting patterns onto the similarity spaces, and SVMs classify an image by its dissimilarity vector. The versatility of the proposed approach in image classification is demonstrated by evaluating the system on different types of images across two domains: two medical data sets and two animal audio data sets with vocalizations represented as images (spectrograms). Results show that the proposed system's performance competes competitively against the best-performing methods in the literature, obtaining state-of-the-art performance on one of the medical data sets, and does so without ad-hoc optimization of the clustering methods on the tested data sets.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据