4.5 Article

The generalizability of machine learning models of personality across two text domains

Journal

PERSONALITY AND INDIVIDUAL DIFFERENCES
Volume 217, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.paid.2023.112465

Keywords

Machine learning; Big Five; LIWC; Text analysis

Ask authors/readers for more resources

Machine learning of high-dimensional models can predict psychological variables accurately within specific domains, but their ability to generalize across domains is limited. Complex models are highly influenced by the specifics of the trained domain, and only a few predictors can survive in different domains.
Machine learning of high-dimensional models have received attention for their ability to predict psychological variables, such as personality. However, it has been less examined to what degree such models are capable of generalizing across domains. Across two text domains (Reddit message and personal essays), compared to lowdimensional- and theoretical models, atheoretical high-dimensional models provided superior predictive accuracy within but poor/non-significant predictive accuracy across domains. Thus, complex models depended more on the specifics of the trained domain. Further, when examining predictors of models, few survived across domains. We argue that theory remains important when conducting prediction-focused studies and that research on both high- and low-dimensional models benefit from establishing conditions under which they generalize.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available