4.4 Article

Logical big data integration and near real-time data analytics

期刊

DATA & KNOWLEDGE ENGINEERING
卷 146, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.datak.2023.102185

关键词

Big data integration; Distributed databases; Near real-time OLAP

向作者/读者索取更多资源

In the field of decision-making, there is a growing need for real-time data that traditional solutions cannot meet. Existing logical data integration solutions also present challenges as they require users to focus on data details rather than analytics. EasyBDI is an open-source system that offers logical data integration and business-oriented abstractions. It automatically identifies partitioned data and proposes a global schema, allowing users to retrieve data from distributed sources without technical knowledge. Experimental results show minimal overhead compared to distributed query execution times.
In the context of decision-making, there is a growing demand for near real-time data that traditional solutions, like data warehousing based on long-running ETL processes, cannot fully meet. On the other hand, existing logical data integration solutions are challenging because users must focus on data location and distribution details rather than on data analytics and decision-making. EasyBDI is an open-source system that provides logical integration of data and high-level business-oriented abstractions. It uses schema matching, integration, and mapping techniques, to automatically identify partitioned data and propose a global schema. Users can then specify star schemas based on global entities and submit analytical queries to retrieve data from distributed data sources without knowing the organization and other technical details of the underlying systems. This work presents the algorithms and methods for global schema creation and query execution. Experimental results show that the overhead imposed by logical integration layers is relatively small compared to the execution times of distributed queries.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据