4.7 Article

A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 136, Issue -, Pages 97-120

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2017.09.004

Keywords

Automatic query expansion; Term weighting schemes; Co-occurrence score; Fuzzy logic; Particle swarm optimization; Term frequency; Inverse document frequency

Ask authors/readers for more resources

Information Retrieval system retrieves relevant documents from large datasets. Automatic Query Expansion (AQE) is one of the approaches to enhance IR performance by adding additional terms to original query. The selection of suitable additional terms for AQE is a crucial task. Term weighting method is one of the ways to deal with such a problem. This paper presents a new term weighting based AQE approach to retrieve more relevant documents from data corpus. The proposed approach comprises of three major steps. First step determines the optimal weights of different IR evidences for different terms using Particle Swarm Optimization (PSO). Fuzzy logic technique is used to improve performance of PSO by controlling inertia and acceleration coefficients during the optimization. Co-occurrence score is introduced as new IR evidence in the proposed approach. Second step is focused on removal of noisy terms by using new combined semantic filtering method. Third step reweights the terms using Rocchio method. The proposed approach is compared with recently developed automatic query expansion approaches in terms of performance measures such as precision, recall, F-measure and MAP (Mean Average Precision). Three benchmark datasets CACM, CISI and TREC-3 are used to verify the results. The proposed approach is found better than other approaches according to results obtained for these benchmark datasets. (C) 2017 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available