4.6 Article

Text Mining in 19th-Century Essays for Investigating a Possible Collaborative Authorship Problem: John Stuart Mill and Harriet Taylor Mill

期刊

IEEE ACCESS
卷 10, 期 -, 页码 20937-20947

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3152201

关键词

Feature extraction; Training; Writing; Task analysis; Reliability; Syntactics; Text mining; Authorship attribution; text classification; machine learning; feature selection

向作者/读者索取更多资源

This study uses machine learning techniques to investigate the authorship of two famous essays in the nineteenth century. The classifiers trained in this research show that John Stuart Mill is the primary author of the essays, but also highlight the contribution of Harriet Taylor Mill to certain portions of text.
In this work, we use machine learning techniques to address a research question regarding the authorship of two famous essays in the nineteenth century. On Liberty (1859) and The Subjection of Women (1869) were published under John Stuart Mill's name, a widely studied nineteenth-century British philosopher. Mill himself attributed them to collaboration with his wife and partner, Harriet Taylor Mill. More than 150 years later, the question remains whether the author of these two canonical texts in the history of political thought was solely John Stuart Mill. Experts are divided on taking John Stuart Mill's attribution at face value, since Harriet Taylor Mill had died in 1858. Addressing this question, we use a dataset consisted in essays of both authors, to train three state-of-the-art classifiers that are able to learn and distinguish the writing style of each author. Then, we use the models built to attribute the two famous essays of disputed authorship to one of the two. From the results, we conclude that the classifiers are able to learn the two classes very well, and they return high accuracies on the validation set. Regarding the test set, most of the models attribute the two essays to John Stuart Mill, however, the contribution of Harriet Taylor Mill is shown for some chunks of text of both essays. These results, we conclude, explain why experts are divided on this particular research question.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据