☆ 4.3 Article

Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

MATHEMATICAL PROBLEMS IN ENGINEERING (2021)

期刊

MATHEMATICAL PROBLEMS IN ENGINEERING

卷 2021, 期 -, 页码 -

出版社

HINDAWI LTD

DOI: 10.1155/2021/3839800

关键词

类别

Engineering, Multidisciplinary Mathematics, Interdisciplinary Applications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The traditional distributed database storage architecture faces efficiency and storage capacity issues in managing seafood product data resources. This study proposes optimization methods based on the Hadoop platform and MapReduce model, with the use of consistent hashing algorithm and parallel processing strategies. The research focuses on data distribution, block size adjustment, and algorithms for efficient management of big data resources related to seafood products.

The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources. A block storage layout optimization method based on the Hadoop platform and a parallel data processing and analysis method based on the MapReduce model are proposed. A multireplica consistent hashing algorithm based on data correlation and spatial and temporal properties is used in the parallel data processing and analysis method. The data distribution strategy and block size adjustment are studied based on the Hadoop platform. A multidata source parallel join query algorithm and a multi-channel data fusion feature extraction algorithm based on data-optimized storage are designed for the big data resources of seafood products according to the MapReduce parallel frame work. Practical verification shows that the storage optimization and data-retrieval methods provide supports for constructing a big data resource-management platform for seafood products and realize efficient organization and management of the big data resources of seafood products. The execution time of multidata source parallel retrieval is only 32% of the time of the standard Hadoop scheme, and the execution time of the multichannel data fusion feature extraction algorithm is only 35% of the time of the standard Hadoop scheme.

Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

期刊

MATHEMATICAL PROBLEMS IN ENGINEERING

出版社

HINDAWI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

期刊

MATHEMATICAL PROBLEMS IN ENGINEERING

出版社

HINDAWI LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文