4.2 Article

Automated Identification of Domestic Violence in Written Child Welfare Records: Leveraging Text Mining and Machine Learning to Enhance Social Work Research and Evaluation

Journal

Publisher

UNIV CHICAGO PRESS
DOI: 10.1086/712734

Keywords

text mining; machine learning; data science; domestic violence; child welfare

Categories

Funding

  1. Casey Family Programs
  2. Michigan Department of Health and Human Services

Ask authors/readers for more resources

This study demonstrates that text mining and machine learning procedures can effectively identify domestic violence issues in child welfare investigation summaries with over 90% accuracy. The reliability of machine learning models in supporting human reviewers was also confirmed. These methods offer a cost-effective solution for extracting meaningful insights from text data in social work research and evaluation, enhancing the use of text data in investigating domestic violence-related needs in the child welfare system.
Objective: Child welfare agencies often lack information about the front-end service needs of the families they serve. Thus, the current study tests the feasibility of text mining and machine learning procedures for identifying problems related to domestic violence documented in child welfare investigation summaries. Method: We labeled child welfare investigation summaries (N = 1,402) for the presence or absence of an active domestic violence service need. Labeled documents were then used to develop text mining and machine learning models and test their accuracy and reliability. Results: Machine learning models achieved greater than 90% accuracy when compared with human coders. Fleiss kappa estimates of coding reliability between the top-performing model and human reviewers exceeded .80, indicating that our model could support human reviewers to complete this coding task. Conclusion: Results provide strong evidence that text mining and machine learning procedures can be a cost-effective solution for extracting meaningful insights from text data. Although unsuitable for case-level predictive analytics, insights derived from these procedures can be particularly useful for investigating the prevalence, temporal trends, and geographic distribution of domestic violence-related needs in the child welfare system. These methods could substantially enhance the use of text data in social work research and evaluation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available