4.7 Article

Lessons and challenges from mining retail e-commerce data

期刊

MACHINE LEARNING
卷 57, 期 1-2, 页码 83-113

出版社

SPRINGER
DOI: 10.1023/B:MACH.0000035473.11134.83

关键词

data mining; data analysis; business intelligence; web analytics; web mining; OLAP; visualization; reporting; data transformations; retail; e-commerce; Simpson's paradox; sessionization; bot detection; clickstreams; application server; web logs; data cleansing; hierarchical attributes; business reporting; data warehousing

向作者/读者索取更多资源

The architecture of Blue Martini Software's e-commerce suite has supported data collection, data transformation, and data mining since its inception. With clickstreams being collected at the application-server layer, high-level events being logged, and data automatically transformed into a data warehouse using meta-data, common problems plaguing data mining using weblogs (e.g., sessionization and conflating multi-sourced data) were obviated, thus allowing us to concentrate on actual data mining goals. The paper briefly reviews the architecture and discusses many lessons learned over the last four years and the challenges that still need to be addressed. The lessons and challenges are presented across two dimensions: business-level vs. technical, and throughout the data mining lifecycle stages of data collection, data warehouse construction, business intelligence, and deployment. The lessons and challenges are also widely applicable to data mining domains outside retail e-commerce.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据