4.7 Article

Analysis of outlier detection rules based on the ASHRAE global thermal comfort database

期刊

BUILDING AND ENVIRONMENT
卷 234, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.buildenv.2023.110155

关键词

Outlier detection; Thermal preference; ASHRAE global Thermal comfort database; Machine learning; Support vector machine

向作者/读者索取更多资源

This study aims to investigate the filter performance of different outlier detection methods. Results show that all three rules can filter some obvious outliers, and the Boxplot rule produces the most moderate filer results, whereas the 3-Sigma rule sometimes fails to detect outliers and the Hampel rule may provide an aggressive solution that causes a false alarm.
ASHRAE Global Thermal Comfort Database has been extensively used for analyzing specific thermal comfort parameters or models, evaluating subjective metrics, and integrating with machine learning algorithms. Outlier detection is regarded as an essential step in data preprocessing, but current publications related to this database paid less attention to the influence of outliers in raw datasets. This study aims to investigate the filter performance of different outlier detection methods. Three stochastic-based approaches have been performed and analyzed based on the example of predicting thermal preference using the Support Vector Machine (SVM) algorithm as a case study to compare the predictions before and after outlier removal. Results show that all three rules can filter some obvious outliers, and the Boxplot rule produces the most moderate filer results, whereas the 3-Sigma rule sometimes fails to detect outliers and the Hampel rule may provide an aggressive solution that causes a false alarm. It has also been discovered that a small reduction in establishing machine learning models can result in less complicated and smoother decision boundaries, which has the potential to provide more energyefficient and conflict-free solutions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据