☆ 4.7 Article

A short text modeling method combining semantic and statistical information

INFORMATION SCIENCES (2010)

Journal

INFORMATION SCIENCES

Volume 180, Issue 20, Pages 4031-4041

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2010.06.021

Keywords

Text similarity; Short text similarity; Information retrieval; Query expansion; Text mining; Question answering

Funding

City University of Hong Kong [7002488]
National Grand Fundamental Research 973 Program of China [2003CB317002]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A novel modeling method for a collection of short text snippets is presented in this paper to measure the similarity between pairs of snippets. The method takes account of both the semantic and statistical information within the short text snippets, and consists of three steps. Given a set of raw short text snippets, it first establishes the initial similarity between words by using a lexical database. The method then iteratively calculates both word similarity and short text similarity. Finally, a proximity matrix is constructed based on word similarity and used to convert the raw text snippets into vectors. Word similarity and text clustering experiments show that the proposed short text modeling method improves the performance of existing text-related information retrieval (IR) techniques. (C) 2010 Elsevier Inc. All rights reserved.

A short text modeling method combining semantic and statistical information

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A short text modeling method combining semantic and statistical information

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper