4.6 Article

Toward a prediction approach based on deep learning in Big Data analytics

期刊

NEURAL COMPUTING & APPLICATIONS
卷 35, 期 8, 页码 6043-6063

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s00521-022-07986-9

关键词

Big Data analytics; Deep learning; GloVe; Hadoop; MapReduce

向作者/读者索取更多资源

Cloud computing plays a crucial role in storing and processing Big Data, and deep learning has been widely adopted to extract predictions from user opinions. This paper introduces a new prediction approach, PABIDDL, based on Big Data analysis and deep learning, which achieves high performance on large-scale datasets.
Nowadays, cloud computing plays an important role in the process of storing both structured and unstructured data. This contributed to a very large data growth on web servers, which has come to be called Big Data. Cloud computing technology is adopted in many applications, perhaps the most important of which are social networking applications, e-mail messages, and others, which represent an important source of data through the process of communication between web users. Thus, these data represent views and opinions on various topics, which can help businesses and other decision makers in making decisions based on future predictions. To achieve this goal, several methods have been proposed. Recently, it relies on the use of deep learning as a tool for processing large volumes of data due to its high performance in extracting predictions from the opinions of web users. This paper presents a new prediction approach based on Big Data analysis and deep learning for large-scale data, called PABIDDL. The infrastructure of the proposed approach is focused on three important stages, starting with the reduction of Big Data based on MapReduce using the Hadoop framework. In the second stage, we performed the initialization of these data using the GloVe technique. Finally, the text data were classified into advantages and disadvantages poles depending on CNN deep learning approach. Also, we conducted an empirical study of our proposed approach PABIDDL and related works models on two standard datasets IMDB and MR datasets. The results obtained showed that the best performance is given by our approach. We recorded 0.93%, 0.90%, and 0.92% as an accuracy, a recall, and an F1-score, respectively. Furthermore, our approach reached the fastest response time.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据