☆ 4.4 Article

Where to go and what to play: Towards summarizing popular information from massive tourism blogs

JOURNAL OF INFORMATION SCIENCE (2015)

Journal

JOURNAL OF INFORMATION SCIENCE

Volume 41, Issue 6, Pages 830-854

Publisher

SAGE PUBLICATIONS LTD

DOI: 10.1177/0165551515603323

Keywords

Blog mining; information retrieval; max-confidence; things of interest; travel sequence

Funding

National Natural Science Foundation of China [71402007 / 71271044 / U1233118 / 71572029]
Fundamental Research Funds for the Central Universities [2014RC0601]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this work, we propose a novel method to summarize popular information from massive tourism blog data. First, we crawl blog contents and segment them into semantic word vectors separately. Then, we select the geographical terms in each word vector into a corresponding geographical term vector and present a new method to explore hot tourism locations and, in particular, their frequent sequential relations from a set of geographical term vectors. Third, we propose a novel word vector subdividing method to collect local features for each hot location, and introduce the metric of max-confidence to identify the Things of Interest (ToI) associated with the location from the collected data. We illustrate the benefits of this approach by applying it to a Chinese online tourism blog dataset. The experimental results show that the proposed method can be used to explore hot locations, as well as their sequential relations and corresponding ToI, efficiently.

Where to go and what to play: Towards summarizing popular information from massive tourism blogs

Journal

JOURNAL OF INFORMATION SCIENCE

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Where to go and what to play: Towards summarizing popular information from massive tourism blogs

Journal

JOURNAL OF INFORMATION SCIENCE

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper