☆ 4.6 Article

Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches

SUSTAINABILITY (2022)

期刊

SUSTAINABILITY

卷 14, 期 8, 页码 -

出版社

MDPI

DOI: 10.3390/su14084723

关键词

climate change; sentiment analysis; lexicon; machine learning; social media

类别

Green & Sustainable Science & Technology Environmental Sciences Environmental Studies

资金

Ministry of Higher Education, Malaysia
Institute of Research, Management and Innovation, Universiti Teknologi MARA under the Fundamental Research Grant Scheme [600-IRMI/FRGS 5/3 (370/2019)]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study aimed to find the most effective sentiment analysis approach for climate change tweets and related domains. The results showed that the hybrid method outperformed other approaches, with the hybrid TextBlob and Logistic Regression achieving the highest F1-score of 75.3%. The study also found that lemmatization improved the accuracy of machine learning and hybrid approaches, while the TF-IDF feature extraction technique slightly outperformed Bag-of-Words.

The emissions of greenhouse gases, such as carbon dioxide, into the biosphere have the consequence of warming up the planet, hence the existence of climate change. Sentiment analysis has been a popular subject and there has been a plethora of research conducted in this area in recent decades, typically on social media platforms such as Twitter, due to the proliferation of data generated today during discussions on climate change. However, there is not much research on the performances of different sentiment analysis approaches using lexicon, machine learning and hybrid methods, particularly within this domain-specific sentiment. This study aims to find the most effective sentiment analysis approach for climate change tweets and related domains by performing a comparative evaluation of various sentiment analysis approaches. In this context, seven lexicon-based approaches were used, namely SentiWordNet, TextBlob, VADER, SentiStrength, Hu and Liu, MPQA, and WKWSCI. Meanwhile, three machine learning classifiers were used, namely Support Vector Machine, Naive Bayes, and Logistic Regression, by using two feature extraction techniques, which were Bag-of-Words and TF-IDF. Next, the hybridization between lexicon-based and machine learning-based approaches was performed. The results indicate that the hybrid method outperformed the other two approaches, with hybrid TextBlob and Logistic Regression achieving an F1-score of 75.3%; thus, this has been chosen as the most effective approach. This study also found that lemmatization improved the accuracy of machine learning and hybrid approaches by 1.6%. Meanwhile, the TF-IDF feature extraction technique was slightly better than BoW by increasing the accuracy of the Logistic Regression classifier by 0.6%. However, TF-IDF and BoW had an identical effect on SVM and NB. Future works will include investigating the suitability of deep learning approaches toward this domain-specific sentiment on social media platforms.

Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches

期刊

SUSTAINABILITY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches

期刊

SUSTAINABILITY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文