4.7 Article

Information clustering based on fuzzy multisets

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 39, Issue 2, Pages 195-213

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/S0306-4573(02)00047-X

Keywords

information retrieval; data clustering; fuzzy multiset; cluster center; algorithm

Ask authors/readers for more resources

A fuzzy multiset model for information clustering is proposed with application to information retrieval on the World Wide Web. Noting that a search engine retrieves multiple occurrences of the same subjects with possibly different degrees of relevance, we observe that fuzzy multisets provide an appropriate model of information retrieval on the WWW. Information clustering which means both term clustering and document clustering is considered. Three methods of the hard c-means, fuzzy c-means, and an agglomerative method using cluster centers are proposed. Two distances between fuzzy multisets and algorithms for calculating cluster centers are defined. Theoretical properties concerning the clustering algorithms are studied. Illustrative examples are given to show how the algorithms work. (C) 2002 Elsevier Science Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available