4.7 Article

Multi-objective multi-view based search result clustering using differential evolution framework

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 168, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

Multi-view clustering; Differential evolution; Multi-objective optimization; Textual entailment; Word mover distance; Universal sentence encoder

资金

  1. Early Career Research Award of Science and Engineering Research Board (SERB) of Department of Science and Technology India

向作者/读者索取更多资源

Search Results Clustering problem is addressed as a multi-view clustering problem in this study, where optimization and new views are introduced to improve the clustering process. The proposed approach shows superior performance compared to state-of-the-art techniques.
Search Results Clustering (SRC) is a well-known problem in the field of information retrieval and refers to the clustering of web-snippets for a given query based on some similarity/dissimilarity measure. In this current study, we have posed Search Results Clustering problem as a multi-view clustering problem and solved it from an optimization point of view. Various views based on syntactic and semantic similarity measures were considered while performing the clustering. In contrast to existing algorithms, three new views based on word mover distance, textual-entailment, and universal sentence encoder, measuring semantics while performing clustering, are incorporated in our framework. Different quality measures computed on clusters generated by different views are optimized simultaneously using multi-objective binary differential evolution (MBDE) framework. MBDE comprises a set of solutions and each solution is composed of two parts corresponding to different views. An agreement index checking the accordance between partitionings of different views is also optimized to obtain a consensus partitioning. The proposed approach is automatic in nature as it is capable of detecting the number of clusters for any query in an automatic way. Experiments are performed on three benchmark multi-view datasets corresponding to web search results and evaluated using well-known F-measure metric. Results obtained illustrate that our approach outperforms state-of-the-art techniques.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据