4.0 Review

An Overview of Data Warehouse and Data Lake in Modern Enterprise Data Management

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Information Systems

Migrating a research data warehouse to a public cloud: challenges and opportunities

Michael G. Kahn et al.

Summary: Clinical research data warehouses are moving to cloud platforms for scalability and flexibility, but face challenges such as legacy system limitations and complex security reviews. Cloud architectures offer new capabilities, but rapid changes can lead to obsolete architectures and associated policies. Governance and cost oversight are critical for successful innovation in a cloud environment.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2022)

Proceedings Paper Computer Science, Artificial Intelligence

From Batch Processing to Real Time Analytics: Running Presto® at Scale

Zhenxiao Luo et al.

Summary: This paper introduces the important features and performance improvements of the open source Presto in recent years, enabling companies to run Presto at scale and support various use cases.

2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) (2022)

Article Computer Science, Artificial Intelligence

On data lake architectures and metadata management

Pegdwende Sawadogo et al.

Summary: With the exponential increase in global data production over the past two decades, the concept of data lakes has been introduced as a solution to the challenges posed by big data. Data lake architectures and metadata management are key issues in successfully implementing data lakes. However, there is still confusion and ambiguity surrounding the concept of data lakes among many researchers and practitioners.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2021)

Article Computer Science, Information Systems

The evolution of Amazon Redshift (extended abstract)

Ippokratis Pandis

PROCEEDINGS OF THE VLDB ENDOWMENT (2021)

Proceedings Paper Computer Science, Theory & Methods

Data Lake Approaches: A Survey

Elisabeta Zagan et al.

2020 15TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS) (2020)

Article Computer Science, Information Systems

Privacy-enhancing ETL-processes for biomedical data

Fabian Prasser et al.

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2019)

Proceedings Paper Computer Science, Hardware & Architecture

A Data Warehouse Approach for Business Intelligence

Georgia Garani et al.

2019 IEEE 28TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE) (2019)

Proceedings Paper Computer Science, Information Systems

JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes

Erkang Zhu et al.

SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (2019)

Proceedings Paper Computer Science, Information Systems

Presto: SQL on Everything

Raghav Sethi et al.

2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019) (2019)

Article Computer Science, Information Systems

Data Lake Management: Challenges and Opportunities

Fatemeh Nargesian et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2019)

Article Computer Science, Information Systems

Juneau: Data Lake Management for Jupyter

Yi Zhang et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2019)

Proceedings Paper Engineering, Electrical & Electronic

Data lake: a new ideology in big data era

Pwint Phyu Khine et al.

4TH ANNUAL INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORK (WCSN 2017) (2018)

Proceedings Paper Computer Science, Information Systems

Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets

Yihan Gao et al.

SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (2018)

Article Computer Science, Software Engineering

The Lambda and the Kappa

Jimmy Lin

IEEE INTERNET COMPUTING (2017)

Proceedings Paper Computer Science, Information Systems

CoreDB: a Data Lake Service

Amin Beheshti et al.

CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT (2017)

Proceedings Paper Computer Science, Information Systems

Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics

Raghu Ramakrishnan et al.

SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (2017)

Proceedings Paper Computer Science, Hardware & Architecture

VOLAP: A Scalable Distributed System for Real-Time OLAP with High Velocity Data

Frank Dehne et al.

2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) (2016)

Proceedings Paper Computer Science, Theory & Methods

Application of Big Data, Fast Data and Data Lake Concepts to Information Security Issues

Natalia Miloslavskaya et al.

2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW) (2016)

Article Computer Science, Information Systems

A Cognitive Adopted Framework for IoT Big-Data Management and Knowledge Discovery Prospective

Nilamadhab Mishra et al.

INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS (2015)

Article Computer Science, Information Systems

A Cognitive Adopted Framework for IoT Big-Data Management and Knowledge Discovery Prospective

Nilamadhab Mishra et al.

INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS (2015)

Proceedings Paper Computer Science, Information Systems

Big Data Analytics as a Service for Business Intelligence

Zhaohao Sun et al.

OPEN AND BIG DATA MANAGEMENT AND INNOVATION, I3E 2015 (2015)

Article Computer Science, Artificial Intelligence

Significance and Challenges of Big Data Research

Xiaolong Jin et al.

BIG DATA RESEARCH (2015)

Article Information Science & Library Science

Beyond the hype: Big data concepts, methods, and analytics

Amir Gandomi et al.

INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT (2015)

Proceedings Paper Engineering, Electrical & Electronic

Solution for Data Growth Problem of MOLAP

Xu Jian et al.

MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4 (2013)

Article Computer Science, Information Systems

Capturing summarizability with integrity constraints in OLAP

CA Hurtado et al.

ACM TRANSACTIONS ON DATABASE SYSTEMS (2005)