Journal
INTELLIGENT DATA ANALYSIS
Volume 20, Issue 1, Pages 29-45Publisher
IOS PRESS
DOI: 10.3233/IDA-150792
Keywords
Mixed attributes; high-dimensional data; cluster boundary; shadowed set
Categories
Funding
- Science and Technology Research Key Project of Henan Province of China [9412013Y1486]
Ask authors/readers for more resources
To meet the need of extracting cluster boundary from mixed attribute data in the field of data analysis, we propose a cluster boundary detection algorithm for mixed attribute data sets in this research, named CHASM(Cluster Boundary Detection Algorithm based on Shadowed Set). Based on the structure of clusters, the CHASM defines a new objective function according to the data set which is categorized into three collections, i. e. core, exclusion and shadow. Then CHASM updates the centroid information of clusters based on the variance of contribution degree among these collections to the clusters centroids. Finally, in an iterative optimization process, the CHASM can extract its shadow sets from each cluster to form the boundary of clusters. The experimental results, on both the synthetic data and real data with mixed attributes, numerical attributes and categorical attributes, show that CHASM can effectively detect cluster boundary with higher or similar accuracy to its rival methods. Furthermore, the CHASM can eliminate noise effectively.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available