4.3 Article

A weighted multi-attribute method for matching user-generated Points of Interest

Journal

CARTOGRAPHY AND GEOGRAPHIC INFORMATION SCIENCE
Volume 41, Issue 2, Pages 125-137

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/15230406.2014.880327

Keywords

Points of Interest; matching; Linked Data; volunteered geographic information

Categories

Ask authors/readers for more resources

To a large degree, the attraction of Big Data lies in the variety of its heterogeneous multi-thematic and multi-dimensional data sources and not merely its volume. To fully exploit this variety, however, requires conflation. This is a two-step process. First, one has to establish identity relations between information entities across different data sources; and second, attribute values have to be merged according to certain procedures that avoid logical contradictions. The first step, also called matching, can be thought of as a weighted combination of common attributes according to some similarity measures. In this work, we propose such a matching based on multiple attributes of Points of Interest (POI) from the Location-based Social Network Foursquare and the local directory service Yelp. While both contain overlapping attributes that can be used for matching, they have specific strengths and weaknesses that make their conflation desirable. For instance, Foursquare offers information about user check-ins to places, while Yelp specializes in user-contributed reviews. We present a weighted multi-attribute matching strategy, evaluate its performance, and discuss application areas that benefit from a successful matching. Finally, we also outline how the established POI matches can be stored as Linked Data on the Semantic Web. Our strategy can automatically match 97% of randomly selected Yelp POI to their corresponding Foursquare entities.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available