4.5 Article

Social Media and Stock Market Prediction: A Big Data Approach

期刊

CMC-COMPUTERS MATERIALS & CONTINUA
卷 67, 期 2, 页码 2569-2583

出版社

TECH SCIENCE PRESS
DOI: 10.32604/cmc.2021.014253

关键词

Big data; analytics; artificial intelligence; machine learning; stock market; social media; business analytics

向作者/读者索取更多资源

Big data involves the collection of large datasets to identify trends and patterns, with challenges including the volume and variety of data. In this paper, machine learning models were applied to analyze stock data of 10 companies, with linear regression, random forest, and generalized linear regression showing accuracy rates of 80%-98%.
Big data is the collection of large datasets from traditional and digital sources to identify trends and patterns. The quantity and variety of computer data are growing exponentially for many reasons. For example, retailers are building vast databases of customer sales activity. Organizations are working on logistics financial services, and public social media are sharing a vast quantity of sentiments related to sales price and products. Challenges of big data include volume and variety in both structured and unstructured data. In this paper, we implemented several machine learning models through Spark MLlib using PySpark, which is scalable, fast, easily integrated with other tools, and has better performance than the traditional models. We studied the stocks of 10 top companies, whose data include historical stock prices, with MLlib models such as linear regression, generalized linear regression, random forest, and decision tree. We implemented naive Bayes and logistic regression classification models. Experimental results suggest that linear regression, random forest, and generalized linear regression provide an accuracy of 80%-98%. The experimental results of the decision tree did not well predict share price movements in the stock market.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据