4.6 Article

Machine learning for pattern discovery in management research

期刊

STRATEGIC MANAGEMENT JOURNAL
卷 42, 期 1, 页码 30-57

出版社

WILEY
DOI: 10.1002/smj.3215

关键词

abduction; decision trees; exploratory data analysis; induction; machine learning; neural networks; partial dependence plots; pattern discovery; random forests; ROC curve; supervised machine learning

向作者/读者索取更多资源

Supervised machine learning methods are powerful tools for discovering robust patterns in quantitative data, but should not be viewed as conclusive proof of causal relationships. This study demonstrates the application of ML algorithms to investigate employee turnover, uncovering surprising patterns between variables through partial dependence plots. It provides guidance for evaluating model performance and warns of common misinterpretation pitfalls to help managers and researchers evaluating analysis conducted by data scientists in their organizations.
Research Summary Supervised machine learning (ML) methods are a powerful toolkit for discovering robust patterns in quantitative data. The patterns identified by ML could be used for exploratory inductive or abductive research, or for post hoc analysis of regression results to detect patterns that may have gone unnoticed. However, ML models should not be treated as the result of a deductive causal test. To demonstrate the application of ML for pattern discovery, we implement ML algorithms to study employee turnover at a large technology company. We interpret the relationships between variables using partial dependence plots, which uncover surprising nonlinear and interdependent patterns between variables that may have gone unnoticed using traditional methods. To guide readers evaluating ML for pattern discovery, we provide guidance for evaluating model performance, highlight human decisions in the process, and warn of common misinterpretation pitfalls. The Supporting Information section provides code and data to implement the algorithms demonstrated in this article. Managerial Summary Supervised machine learning (ML) methods are a powerful toolkit that might help managers and researchers discover interesting patterns in large and complex data. We demonstrate this by using several ML algorithms to investigate the drivers of employee turnover at a large technology company. We evaluate the performance of the models, and use visual tools to interpret the patterns revealed. These patterns can be useful in understanding turnover, but we caution not to confuse correlation with causation. These methods should be viewed as exploratory and not conclusive proof of relationships in the data. Our guidance can be helpful for managers evaluating analysis conducted by data scientists in their organizations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据